Yaping Zhang (张亚萍)

I am an associate researcher at State Key Laboratory of Multimodal Artificial Intelligence Systems(MAIS), Institute of Automation, Chinese Academy of Sciences (CASIA). I received the Ph.D degree from CASIA in 2020. My research primarily centers on multimodal learning, document analysis and natural language processing. Recently, we aim to establish a new pathway toward document image translation with complex layout.


News

  • One paper is accepted by NACCAL 2024
  • One paper is accepted by COLING 2024
  • One paper is accepted by ICASSP 2024
  • One paper is accepted by IEEE TASLP
  • Two papers are accepted by EMNLP 2023

Selected Publications

  1. Yaping Zhang, Shuai Nie, Wenju Liu, Xing Xu, Dongxiang Zhang, Heng Tao Shen. Sequence-to-sequence domain adaptation network for robust text image recognition. Proceedings of the IEEE/CVF conference on CVPR. 2019. (CCF A)

  2. Yaping Zhang, Shuai Nie, Shan Liang, Wenju Liu. Robust text image recognition via adversarial sequence-to-sequence domain adaptation. IEEE trans. on Image Processing. 2021. (CCF A)

  3. Yupu Liang, Yaping Zhang, Cong Ma, Zhiyang Zhang, Yang Zhao, Lu Xiang, Chengqing Zong, Yu Zhou. Document Image Machine Translation with Dynamic Multi-pre-trained Models Assembling. In The 2024 Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL 2024). Mexico City, Mexico. June 16-21, 2024. (CCF B)

  4. Cong Ma, Yaping Zhang, Zhiyang Zhang, Yupu Liang, Yang Zhao, Yu Zhou, Chengqing Zong. Born a BabNet with Hierarchical Parental Supervision for End-to-End Text Image Machine Translation. In The 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024). Torino, Italia. May 20-25, 2024. (CCF B)

  5. Zhiyang Zhang, Yaping Zhang, Yupu Liang, Lu Xiang, Yang Zhao, Yu Zhou, Chengqing Zong. LayoutDIT: Layout-Aware End-to-End Document Image Translation with Multi-Step Conductive Decoder. In Findings of the 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2023), Singapore. December 6-10, 2023. pp. 4959–4965. ACL_Anthology_version. (CCF B)

  6. Cong Ma, Xu Han, Linghui Wu, Yaping Zhang, Yang Zhao, Yu Zhou, and Chengqing Zong. Modal Contrastive Learning based End-to-End Text Image Machine Translation. IEEE/ACM Transactions on Audio, Speech, and Language Processing. IEEEXplore Early Access. (CCF B)

  7. Cong Ma, Yaping Zhang, Mei Tu, Yang Zhao, Yu Zhou, Chengqing Zong. CCIM: Cross-Modal Cross-Lingual Interactive Image Translation. In Findings of the 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2023), Singapore. December 6-10, 2023. pp. 4959–4965. ACL_Anthology_version. (CCF B)

More papers


Projects

  1. 国家自然科学基金委员会, 重点项目, 62336008, 大规模多语种多模态神经机器翻译关键技术研究, 2024-01-01 至 2028-12-31, 233万元, 在研, 参与

  2. 国家自然科学基金委员会, 青年科学基金项目, 62106265, 融合视觉与文本信息的图像文本机器翻译方法研究, 2022-01-01 至 2024-12-31, 30万元, 在研, 主持


Awards

  • 2023, Best Paper Award for the 19th China Conference on Machine Translation, CCMT2023

  • 2018, Best Student Paper Award for International Conference on Acoustics, Speech, and Signal Processing (ICASSP)


Academic Activities

  • Committee Member, Chinese Society of Image and Graphics, Document Image Analysis and Recognition(2023-)

  • Committee Member, Chinese Society of Image and Graphics, Big Vision Data Committee(2023-)

  • Committee Member, Chinese Information Processing Society of China, Youth Working Committee(2023-)

  • Reviewers for AAAI, ACL, EMNLP, NACCAL, ICME, ICASSP, CVMJ, TALLIP, etc