Brief Biography
I am an associate professor at Northeastern University (NEU). I am working in the natural language processing lab led by Prof. Jingbo Zhu .I received my PhD degree in computer science in July 2012 from Northeastern University (co-advisors: Prof. Jingbo Zhu and Prof. Keh-Yih Su). Before that, I worked on Example-Based Machine Translation (EBMT) and Chinese chunking with my advisors Prof. Li Zhang and Prof. Tianshun Yao. I also visited the Speech group at University of Cambridge as a postdoctoral researcher in 2013-2014, where my advisor was Prof. Bill Byrne. Here is my CV.
Northeastern University is located in Shenyang (沈阳), China, where I grew up. Shenyang is really a wonderful place. I love it very much!
Interests
I am interested in NLP and machine learning, particularly in Machine Translation (MT). I hope to exploit models that are motivated by linguistic notions and can learn underlying structures of language.
Publications (full list)
2021:
Learning Light-Weight Translation Models from Deep Transformer
, Bei Li, ZiyangWang, Hui Liu, Quan Du, Tong Xiao, Chunliang Zhang, Jingbo Zhu.
In Proc. of the Thirty-Fifth AAAI Conference on Artificial Intelligence (AAAI), 2021, Virtual Event. [pdf]
An Efficient Transformer Decoder with Compressed Sub-layers
, Yanyang Li, Ye Lin, Tong Xiao, Jingbo Zhu.
In Proc. of the Thirty-Fifth AAAI Conference on Artificial Intelligence (AAAI), 2021, Virtual Event. [pdf]
Weight Distillation: Transferring the Knowledge in Neural Network Parameters
, Ye Lin, Yanyang Li, Ziyang Wang, Bei Li, Quan Du, Tong Xiao and Jingbo Zhu.
In Proc. of the 59th Annual Meeting of the Association for Computational Linguistics (ACL), 2021, Virtual Event. [pdf]
Stacked Acoustic-and-Textual Encoding: Integrating the Pre-trained Models into Speech Translation Encoders
, Chen Xu, Bojie Hu, Yanyang Li, Yuhao Zhang, Shen Huang, Qi Ju, Tong Xiao, Jingbo Zhu.
In Proc. of the 59th Annual Meeting of the Association for Computational Linguistics (ACL), 2021, Virtual Event. [pdf]
Topology-Sensitive Neural Architecture Search for Language Modeling, Quan Du, Nuo Xu, Yinqiao Li, Tong Xiao, Jingbo Zhu.
In IEEE Access. [pdf]
2020:
Neural Machine Translation with Joint Representation
, Yanyang Li, Qiang Wang, Tong Xiao, Tongran Liu, Jingbo Zhu.
In Proc. of the Thirty-Fourth AAAI Conference on Artificial Intelligence (AAAI), 2020, New York, USA. [pdf]
Learning Architectures from an Extended Search Space for Language Modeling, Yinqiao Li, Chi Hu, Yuhao Zhang, Nuo Xu, Yufan Jiang, Tong Xiao, Jingbo Zhu, Tongran Liu, Changliang Li.
In Proc. of the 58th Annual Meeting of the Association for Computational Linguistics (ACL), 2020, Seattle, USA. [pdf]
Does Multi-Encoder Help? A Case Study on Context-Aware Neural Machine Translation, Bei Li, Hui Liu, Ziyang Wang, Yufan Jiang, Tong Xiao, Jingbo Zhu, Tongran Liu, Changliang Li.
In Proc. of the 58th Annual Meeting of the Association for Computational Linguistics (ACL), 2020, Seattle, USA. [pdf]
Towards Fully 8-bit Integer Inference for the Transformer Model, Ye Lin , Yanyang Li, Tengbo Liu, Tong Xiao, Tongran Liu and Jingbo Zhu.
In Proc. of the 29th International Joint Conference on Artificial Intelligence and the 17th Pacific Rim International Conference on Artificial Intelligence, 2020, Yokohama, Japan. [pdf]
Shallow-to-Deep Training for Neural Machine Translation, Bei Li, Ziyang Wang, Hui Liu, Yufan Jiang, Quan Du, Tong Xiao, Huizhen Wang and Jingbo Zhu.
In Proc. of the 2020 Conference on Empirical Methods in Natural Language Processinge, 2020, Punta Cana, Dominican Republic. [To Appear]
Training Flexible Depth Model by Multi-Task Learning for Neural Machine Translation, Qiang Wang, Tong Xiao and Jingbo Zhu.
In Proc. of the 2020 Conference on Empirical Methods in Natural Language Processinge, 2020, Punta Cana, Dominican Republic. [To Appear]
A Simple and Effective Approach to Robust Unsupervised Bilingual Dictionary Induction
, Yanyang Li, Yingfeng Luo, Ye Lin, Quan Du, Huizhen Wang, Tong Xiao and Jingbo Zhu.
In Proc. of the 28th International Conference on Computational Linguistics, 2020, Barcelona, Spain. [To Appear]
Layer-wise Multi-view Learning for Neural Machine Translation, Qiang Wang, Yue Zhang, Tong Xiao and Jingbo Zhu.
In Proc. of the 28th International Conference on Computational Linguistics, 2020, Barcelona, Spain. [To Appear]
Dynamic Curriculum Learning for Low-Resource Neural Machine Translation, Chen Xu, Bojie Hu, Yufan Jiang, Kai Feng, Zeyang Wang, shen huang, Qi Ju, Tong Xiao and Jingbo Zhu.
In Proc. of the 28th International Conference on Computational Linguistics, 2020, Barcelona, Spain. [To Appear]
2019:
Learning Deep Transformer Models for Machine Translation, Qiang Wang, Bei Li, Tong Xiao, Jingbo Zhu, Changliang Li, Derek F. Wong, Lidia S. Chao.
In Proc. of the 57th Annual Meeting of the Association for Computational Linguistics (ACL), 2019, Florence, Italy. [pdf]
Sharing Attention Weights for Fast Transformer, Tong Xiao, Yinqiao Li, Jingbo Zhu, Zhengtao Yu and Tongran Liu.
In Proc. of the 28th International Joint Conference on Artificial Intelligence, Macao, China. [pdf]
The NiuTrans Machine Translation Systems for WMT19, Bei Li, Yinqiao Li, Chen Xu, Ye Lin, Jiqiang Liu, Hui Liu, Ziyang Wang, Yuhao Zhang, Nuo Xu, Zeyang Wang, Kai Feng, Hexuan Chen, Tengbo Liu, Yanyang Li, Qiang Wang, Tong Xiao and Jingbo Zhu.
In Proc. of the Fourth Conference on Machine Translation (WMT), Florence, Italy, 2019. [pdf]
Research on inference acceleration method of Neural Machine Translation system based on coarse2fine , Yuhao Zhang, Nuo Xu, Yinqiao Li, Tong Xiao and Jingbo Zhu.
In Proc. of the 15th China Conference on Machine Translation (CCMT), Nanchang, China. [pdf]
NiuTrans Submission for CCMT19 Quality Estimation Task, Ziyang Wang, Hui Liu, Hexuan Chen, Kai Feng, Zeyang Wang, Bei Li, Chen Xu, Tong Xiao and Jingbo Zhu.
In Proc. of the 15th China Conference on Machine Translation (CCMT), Nanchang, China. [pdf]
Improved Differentiable Architecture Search for Language Modeling and Named Entity Recognition, Yufan Jiang, Chi Hu, Tong Xiao, Chunliang Zhang and Jingbo Zhu.
In Proc. of the 2019 Conference of Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP), Hong Kong, China. [pdf]
Analysis of Back-translation Methods for Low-Resource Neural Machine Translation, Nuo Xu, Yinqiao Li, Chen Xu, Yanyang Li, Bei Li, Tong Xiao and Jingbo Zhu.
In Proc. of the 8th CCF International Conference on Natural Language Processing and Chinese Computing (NLPCC), Dunhuang, China. [pdf]
2018:
A Simple and Effective Approach to Coverage-Aware Neural Machine Translation, Yanyang Li, Tong Xiao, Yinqiao Li, Qiang Wang, Changming Xu and Jingbo Zhu.
In Proc. of the fifty-sixth Annual Meeting of the Association for Computational Linguistics, Melbourne, Australia. [pdf]
Multi-layer Representation Fusion for Neural Machine Translation, Qiang Wang, Fuxue Li, Tong Xiao, Yanyang Li, Yinqiao Li and Jingbo Zhu.
In Proc. of the 27th International Conference on Computational Linguistics, Santa Fe, New Mexico, USA, 2018. [pdf]
Towards Building a Strong Transformer Neural Machine Translation System, Qiang Wang, Bei Li, Jiqiang Liu, Bojian Jiang, Zheyang Zhang, Yinqiao Li, Ye Lin, Tong Xiao and Jingbo Zhu.
In Proc. of the 14th China Workshop on Machine Translation, Wuyishan, China, 2018. [pdf]
Learning Neuron Connections for Language Models, Yufan Jiang, Bei Li, Ye Lin, Yinqiao Li, Tong Xiao and Jingbo Zhu.
In Proc. of the 14th China Workshop on Machine Translation(in Chinese), Wuyishan, China, 2018. [pdf]
On Ensemble Learning of Neural Machine Translation, Bei Li, Qiang Wang, Yufan Jiang, Zheyang Zhang, Jiqiang Liu, Li Zhang and Tong Xiao.
In Proc. of the Seventeenth China National Conference on Computational Linguistics(in Chinese), Changsha, China. [pdf]
On Storage Compression of Neural Machine Translation, Ye Lin, Yufan Jiang, Hengyu Li and Tong Xiao.
In Proc. of the Seventeenth China National Conference on Computational Linguistics(in Chinese), Changsha, China. [pdf]
2017:
Fast Parallel Training of Neural Language Models, Tong Xiao, Jingbo Zhu, Tongran Liu and Chunliang Zhang.
In Proc. of the twenty-sixth International Joint Conference on Artificial Intelligence, Melbourne, Australia. [pdf]
Towards Bidirectional Hierarchical Representations for Attention-based Neural Machine Translation, Baosong Yang, Derek F. Wong, Tong Xiao, Lidia S. Chao, Jingbo Zhu.
In Proc. of Conference on Empirical Methods in Natural Language Processing 2017, Copenhagen, Denmark. [pdf]
Analysis of Data Parallel Methods in Training Neural Language Models via Multiple GPUs, Yinqiao Li, Ambyer Han, Le Bo, Tong Xiao, Jingbo Zhu, Li Zhang.
In Proc. of the 13th China Workshop on Machine Translation (in Chinese), 2017, Dalian, China. [pdf]
2016:
Syntactic Skeleton-based Translation, Tong Xiao, Jingbo Zhu, Chunliang Zhang and Tongran Liu.
In Proc. of the thirtieth AAAI conference, Phoenix, USA. [pdf]
On Decoding with Augmented Hierarchical Phrase-based Translation Models using Tree-to-String Models, Tong Xiao and Jingbo Zhu.
To appear in Chinese Journal of Computers (in Chinese).
A Loss-Augmented Approach to Training Syntactic Machine Translation Systems, Tong Xiao, Derek F. Wong, Jingbo Zhu.
In IEEE/ACM Transactions on Audio, Speech and Language Processing. [pdf]
2015:
Improving Syntactic Rule Extraction through Deleting Spurious Links with Translation Span Alignment, Jingbo Zhu, Qiang Li and Tong Xiao.
Natural Language Engineering, 21(2):227-249. [pdf]
A Comparison of Pruning Methods for CYK-based Decoding in Machine Translation, Yuze Gao and Tong Xiao.
In Proc. of the 11th China Workshop on Machine Translation, Hefei, China.
2014:
Effective Incorporation of Source Syntax into Hierarchical Phrase-based Translation, Tong Xiao, Adrià de Gispert, Jingbo Zhu and Bill Byrne.
In Proc. of the 25th International Conference on Computational Linguistics, Dublin, Ireland. [pdf]
A Hybrid Approach to Skeleton-based Translation, Tong Xiao, Jingbo Zhu and Chunliang Zhang.
In Proc. of the 52st Annual Meeting of the Association for Computational Linguistics (ACL, short papers), Baltimore, USA. [pdf]
2013:
Unsupervised Sub-tree Alignment for Tree-to-tree Translation, Tong Xiao and Jingbo Zhu.
In Journal of Artificial Intelligence Research (JAIR), Volume 48, 2013. [pdf]
NiuTrans Open-source Statistical Machine Translation System V1.3.0, Qiang Li, Kunjie Sun, Zhuo Liu, Tong Xiao and Jingbo Zhu
In Proc. of CWMT 2013 (in Chinese), 2013, Kunming, China. [pdf]
Chinese Sentence Compression: Corpus and Evaluation , Chunliang Zhang, Minghan Hu, Tong Xiao and Jingbo Zhu
In Proc. of Chinese Computational Linguistics (CCL) and Natural Language Processing Based on Naturally Annotated Big Data, 2013, Beijing, China. [pdf (from the publisher)]
Easy-First POS Tagging and Dependency Parsing with Beam Search, Ji Ma, Jingbo Zhu, Tong Xiao and Nan Yang.
In Proc. of the 51st Annual Meeting of the Association for Computational Linguistics (ACL, short paper), 2013, Sofia, Bulgaria. [pdf]
Bagging and Boosting Statistical Machine Translation Systems, Tong Xiao, Jingbo Zhu and Tongran Liu.
Artificial Intelligence (AI), Volume 195, February 2013. [pdf (from the publisher)]
2012:
Easy-First Chinese POS Tagging and Dependency Parsing, Ji Ma, Tong Xiao, Jingbo Zhu and Feiliang Ren.
In Proc. of the 24rd International Conference on Computational Linguistics (COLING), 2012, Mumbai, India. [pdf]
NiuTrans: An Open Source Toolkit for Phrase-based and Syntax-based Machine Translation, Xiao Tong, Jingbo Zhu, Hao Zhang and Qiang Li
In Proc. of The 50th Annual Meeting of the Association for Computational Linguistics (ACL, demonstration session), 2012, Jeju, Korea. [pdf]
Learning Better Rule Extraction with Translation Span Alignment, Jingbo Zhu, Tong Xiao and Chunliang Zhang
In Proc. of The 50th Annual Meeting of the Association for Computational Linguistics (ACL, short paper), 2012, Jeju, Korea. [pdf]
Learning Compact Translation Models by Composing-Based Phrase Extraction, Qiang Li, Yongbai Gao, Tong Xiao, Hao Zhang and Jingbo Zhu
In Proc. of the 6th Youth Conference of Computational Linguistics (YCCL, in Chinese), 2012, Shanghai, China. [pdf, code]
Single-Model System Combination for Shift-Reduce Parsers, Ji Ma, Muhua Zhu, Tong Xiao and Jingbo Zhu
In Journal of Chinese Information Processing (in Chinese), 2012(3).
2011:
In Proc. of MT summit XIII, 2011, Xiamen, China. [pdf]
Improving Decoding Generalization for Tree-to-String Translation, Jingbo Zhu and Tong Xiao
In Proc. of The 49th Annual Meeting of the Association for Computational Linguistics (ACL, short paper), 2011, Portland, USA. [pdf]
Automatic Treebank Conversion via Informed Decoding - A Case Study on Chinese Treebanks, Muhua Zhu, Jingbo Zhu and Tong Xiao
In ACM Transactions on Asian Language Information Processing (TALIP), Speical Issue on Chinese Language Processing [pdf (from the publisher)]
Language Modeling for Syntax-based Machine Translation Using Tree Substitution Grammars: A Case Study on Chinese-English Translation, Tong Xiao, Jingbo Zhu and Muhua Zhu
In ACM Transactions on Asian Language Information Processing (TALIP), Speical Issue on Chinese Language Processing [pdf (from the publisher)]
Selection of SMT Training Data Based on Sentence Pair Quality and Coverage, Shujie Yao, Tong Xiao and Jingbo Zhu
In Journal of Chinese Information Processing (in Chinese), 2011(2).
2010:
An Empirical Study of Translation Rule Extraction with Multiple Parsers, Tong Xiao, Jingbo Zhu, Hao Zhang and Muhua Zhu
In Proc. of The 23rd International Conference on Computational Linguistics (COLING, poster session), 2010, Beijing, China. [pdf]
Heterogeneous Parsing via Collaborative Decoding, Muhua Zhu, Jingbo Zhu and Tong Xiao
In Proc. of The 23rd International Conference on Computational Linguistics (COLING), 2010, Beijing, China. [pdf]
Boosting-based System Combination for Machine Tranlsation, Tong Xiao, Jingbo Zhu, Muhua Zhu and Huizhen Wang
In Proc. of The 48th Annual Meeting of the Association for Computational Linguistics (ACL), 2010, Uppsala, Sweden. [pdf]
The impact of parsing accuracy on syntax-based SMT, Hao Zhang, Tong Xiao and Jingbo Zhu
In Proc. of IEEE NLPKE, 2010, Beijing, China. [pdf (from the publisher)]
Word Re-alignment for Statistical Machine Translation, Tong Xiao, Tianning Li, Rushan Chen, Jingbo Zhu and Huizhen Wang
In Journal of Chinese Information Processing (in Chinese), 2010(1): 110-116.
2009:
Better Synchronous Binarization for Machine Translation, Tong Xiao, Mu Li, Dongdong Zhang, Jingbo Zhu and Ming Zhou
In Proc. of Empirical Methods in Natural Language Processing (EMNLP), 2009, Singapore. [pdf]
The Feature Subspace Method for SMT System Combination, Nan Duan, Mu Li, Tong Xiao and Ming Zhou
In Proc. of Empirical Methods in Natural Language Processing (EMNLP), 2009, Singapore. [pdf]
Competitions
WMT19 Kazakh-English, English-Kazakh, Gujarati-English, German-Czech, Czech-German, Russian-English, English-Russian, Chinese-English, German-English, English-German, Lithuanian-English MT track - 1st place, 1st place, 1st place, 2nd place, 2nd place, 2nd place, 3rd place, 3rd place, 3rd place, 3rd place and 3rd place (BLEU)
The NiuTrans Machine Translation Systems for WMT19, Bei Li, Yinqiao Li, Chen Xu, Ye Lin, Jiqiang Liu, Hui Liu, Ziyang Wang, Yuhao Zhang, Nuo Xu, Zeyang Wang, Kai Feng, Hexuan Chen, Tengbo Liu, YanYang Li, Qiang Wang, Tong Xiao and Jingbo Zhu.
In Proc. of the Fourth Conference on Machine Translation (WMT), Florence, Italy, 2019. [pdf]
WMT18 Chinese-English, English-Chinese MT track - 2nd place and 2nd place (BLEU)
The NiuTrans Machine Translation System for WMT18, Qiang Wang, Bei Li, Jiqiang Liu, Bojian Jiang, Zheyang Zhang, Yinqiao Li, Ye Lin, Tong Xiao and Jingbo Zhu.
In Proc. of the Third Conference on Machine Translation (WMT), Brussels, Belgium, 2018. [pdf]
CWMT18 Chinese-English, English-Chinese MT track - 1st place and 2nd place (BLEU)
The NiuTrans Machine Translation System for CWMT-2018, Qiang Wang, Bei Li, Jiqiang Liu, Bojian Jiang, Zheyang Zhang, Yinqiao Li, Ye Lin, Tong Xiao and Jingbo Zhu.
In Proc. of the Fourteenth China workshop on Machine Translation (CWMT), Fujian, China, 2018. [pdf]
WMT13 Russian-English MT track - 2st/1st place (case-insensitve/sensitive BLEU)
The University of Cambridge Russian-English System at WMT13, Juan Pino, Aurelien Waite, Tong Xiao, Adrià de Gispert, Federico Flego and William Byrne
In Proc. of Proceedings of the Eighth Workshop on Statistical Machine Translation (WMT), 2013, Sofia, Bulgaria. [pdf]
NTCIR-9 Chinese-English patent MT track - 2nd place (human evaluation)
The NiuTrans Machine Translation System for NTCIR-9, Tong Xiao, Qiang Li, Qi Lu, Hao Zhang, Haibo Ding, Shujie Yao, Xiaoming Xu, Xiaoxu Fei, Jingbo Zhu, Feiliang Ren and Huizhen Wang
In Proc. of NTCIR-9Workshop Meeting, 2011, Tokyo, Japan. [pdf]
CWMT2011 English-Chinese and Chinese-English news tracks - 1st place and 4th place (BLEU)
The NiuTrans Machine Translation System for CWMT2011, Tong Xiao, Hao Zhang, Qiang Li, Qi Lu, Jingbo Zhu, Feiliang Ren and Huizhen Wang
In Proc. of The 6th China workshop on Machine Translation (CWMT), 2011, Xiamen, China. [pdf]
CWMT2009 Chinese-English Single System Track - 2nd place (BLEU)
NEUTrans: a Phrase-Based SMT System for CWMT2009, Tong Xiao, Rushan Chen, Tianning Li, Muhua Zhu, Jingbo Zhu, Huizhen Wang and Feiliang Ren
In Proc. of The 5th China workshop on Machine Translation (CWMT), 2009, Nanjing, China. [pdf]
NTCIR-7 English Patent Mining Track - 1st place (MAP)
KNN and Re-ranking Models for English Patent Mining at NTCIR-7, Tong Xiao, Feifei Cao, Tianning Li, Guolong Song, Ke Zhou, Jingbo Zhu and Huizhen Wang
In Proc. of NTCIR-7 Workshop Meeting, 2008, Tokyo, Japan. [pdf]
Software
NiuTrans.SMT: an open-source MT system. Many useful features can be found in NiuTrans.SMT. Check it out!
Professional Activities
PC members/reviewers for ACL2013/2014/2015/2016/2017/2018/2019, COLING2014/2016/2018, EMNLP/2016/2017/2018/2019, NAACL2018/2019, AAAI2017/2018/2019, IJCAI2018/2019, CWMT2013-2019, IJCNLP2011/2013/2015/2017, CCIR2013-2019
Reviewer for IEEE Transactions on Audio, Speech and Language Processing (2017-2019)
Reviewer for ACM Transactions on Asian and Low-Resource Language Information Processing (2018-2019)
Reviewer for Journal of the Acoustical Society of America (2011-2012)
Reviewer for International Journal of Pattern Recognition and Artificial Intelligence (2012-2013)
Reviewer for International Journal of Computational Linguistics and Chinese Language Processing (2007-2009)
Reviewer for International Journal of Computer Processing Of Languages (2010-2012)
Reviewer for Chinese Journal of Electronics (2012-2018)
Reviewer for Journal of Software (2018-2019)
Reviewer for ACTA AUTOMATICA SINICA (2016-2019)
Beyond Academics
I like playing basketball, hiking, climbing mountains.
Finding my sweetheart, Tongran Liu , was the luckiest thing that ever happened to me.
~ Have fun!
Page last modified on Aug 4, 2019