Publications

Zhiyang Zhang, Yaping Zhang, Yupu Liang, Cong Ma, Lu Xiang, Yang Zhao, Yu Zhou, and Chengqing Zong. Understand Layout and Translate Text: Unified Feature-Conductive End-to-End Document Image Translation. IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI). Accepted. https://doi.org/10.1109/TPAMI.2025.3530998 (CCF A, Corresponding Author)
Yaping Zhang, Shuai Nie, Wenju Liu, Xing Xu, Dongxiang Zhang, Heng Tao Shen. Sequence-to-sequence domain adaptation network for robust text image recognition. Proceedings of the IEEE/CVF conference on CVPR. 2019. (CCF A)
Yaping Zhang, Shuai Nie, Shan Liang, Wenju Liu. Robust text image recognition via adversarial sequence-to-sequence domain adaptation. IEEE trans. on Image Processing. 2021. (CCF A)
Lu Xiang, Yang Zhao, Yaping Zhang, Chengqing Zong. A Survey of Large Language Models in Discipline-specific Research: Challenges, Methods and Opportunities. Studies in Informatics and Control, ISSN 1220-1766, vol. 34(1), pp. 5-24, 2025.https://doi.org/10.24846/v34i1y202501
Jing Ye, Lu Xiang, Yaping Zhang, Chengqing Zong. SweetieChat: A Strategy-Enhanced Role-playing Framework for Diverse Scenarios Handling Emotional Support Agent. Proceedings of the 31st International Conference on Computational Linguistics(COLING-2025). Abu Dhabi, UAE. pages 4646–4669.
Zhiyang Zhang, Yaping Zhang, Yupu Liang, Lu Xiang, Yang Zhao, Yu Zhou, Chengqing Zong. From Chaotic OCR Words to Coherent Document: A Fine-to-Coarse Zoom-Out Network for Complex-Layout Document Image Translation. Proceedings of the 31st International Conference on Computational Linguistics(COLING-2025). Abu Dhabi, UAE. pages 10877–10890.
Cong Ma, Xu Han, Linghui Wu, Yaping Zhang, Yang Zhao, Yu Zhou, and Chengqing Zong. Modal Contrastive Learning based End-to-End Text Image Machine Translation. IEEE/ACM Transactions on Audio, Speech, and Language Processing (IEEE/ACM TASLP), vol. 32, pp. 2153-2165, 2024, https://doi.org/10.1109/TASLP.2023.3324540
Yupu Liang, Yaping Zhang, Cong Ma, Zhiyang Zhang, Yang Zhao, Lu Xiang, Chengqing Zong, Yu Zhou. Document Image Machine Translation with Dynamic Multi-pre-trained Models Assembling. In The 2024 Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL 2024). Mexico City, Mexico. June 16-21, 2024.
Cong Ma, Yaping Zhang, Zhiyang Zhang, Yupu Liang, Yang Zhao, Yu Zhou, Chengqing Zong. Born a BabNet with Hierarchical Parental Supervision for End-to-End Text Image Machine Translation. In The 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024). Torino, Italia. May 20-25, 2024.
Cong Ma, Yaping Zhang, Yang Zhao, Yu Zhou, Chengqing Zong. Vector Quantization Knowledge Transfer for End-to-End Text Image Machine Translation. In The 49th IEEE International Conference on Acoustics, Speech, & Signal Processing (ICASSP 2024). COEX, Seoul, Korea. April 14-19, 2024. IEEE Xplore Version.
Zhiyang Zhang, Yaping Zhang, Yupu Liang, Lu Xiang, Yang Zhao, Yu Zhou, Chengqing Zong. A Novel Dataset and Benchmark Analysis on Document Image Translation. In Proceddings of the 2023 China Conference on Machine Translation (CCMT 2023).
Cong Ma, Xu Han, Linghui Wu, Yaping Zhang, Yang Zhao, Yu Zhou, and Chengqing Zong. Modal Contrastive Learning based End-to-End Text Image Machine Translation. IEEE/ACM Transactions on Audio, Speech, and Language Processing. IEEEXplore Early Access.
Zhiyang Zhang, Yaping Zhang, Lu Xiang, Yang Zhao, Yu Zhou, Chengqing Zong. LayoutDIT: Layout-Aware End-to-End Document Image Translation with Multi-Step Conductive Decoder. In Findings of the 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2023), Singapore. December 6-10, 2023. pp. 4959–4965. ACL_Anthology_version. (CCF B)
Cong Ma, Yaping Zhang, Mei Tu, Yang Zhao, Yu Zhou, Chengqing Zong. CCIM: Cross-Modal Cross-Lingual Interactive Image Translation. In Findings of the 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2023), Singapore. December 6-10, 2023. pp. 4959–4965. ACL_Anthology_version.
Cong Ma, Yaping Zhang, Mei Tu, Yang Zhao, Yu Zhou, and Chengqing Zong. E2TIMT: Efficient and Effective Modal Adapter for Text Image Machine Translation. In The 17th Document Analysis and Recognition (ICDAR 2023), San José, California, USA. August 21-26, 2023. pp 70–88. Cham. Springer Nature Switzerland. arXiv_version, Springer_Link
Cong Ma, Yaping Zhang, Mei Tu, Yang Zhao, Yu Zhou, and Chengqing Zong. Multi-Teacher Knowledge Distillation for End-to-End Text Image Machine Translation. In The 17th Document Analysis and Recognition (ICDAR 2023), San José, California, USA. August 21-26, 2023. pp. 484–501, Cham. Springer Nature Switzerland. (Oral Paper) arXiv_version, Springer_Link
Cong Ma, Yaping Zhang, Mei Tu, Xu Han, Linghui Wu, Yang Zhao, Yu Zhou. Improving End-to-End Text Image Translation From the Auxiliary Text Translation Task. In Proceedings of the 26th International Conference on Pattern Recognition (ICPR 2022), Virtually, Montréal Québec, Canada. August 21-25, 2022. pp.1664-1670. arXiv_version, ieeexplore_version, GitHub.
Yaping Zhang, Shan Liang, Shuai Nie, Wenju Liu, Shouye Peng. Robust offline handwritten character recognition through exploring writer-independent features under the guidance of printed data. Pattern Recognition Letters. 2018.
Yaping Zhang, Shuai Nie, Shan Liang, Wenju Liu. Bidirectional adversarial domain adaptation with semantic consistency. In Proceddings of the 2019 Pattern Recognition and Computer Vision (PRCV 2019).
Bin Liu, Shuai Nie, Yaping Zhang, Dengfeng Ke, Shan Liang, Wenju Liu. Boosting noise robustness of acoustic model via deep adversarial training. Proceedings of the ICASSP. 2018. (CCF B)

Patents

发明专利：CN114626392B, 端到端文本图像翻译模型训练方法, 授权时间：2023.02.21
发明专利：CN113011202B, 基于多任务训练的端到端图像文本翻译方法、系统、装置，授权时间：2023.07.25

Yaping Zhang

Publications

Patents