Yanhui Li

alt text 

PhD, Assistant Professor (助理教授/博士生导师)
Software Institute
Nanjing University
Room 926, FeiYimin Building, Nanjing University
Nanjing, China 210023
Email: yanhuili [@] nju [DOT] edu [DOT] cn

About me

I received the BS, MS, and PhD degrees in Computer Science from Southeast University, China. I am currently an assistant research professor in the Software Institute at Nanjing University. My main research interests include intelligent software testing, empirical software engineering, software analysis, knowledge engineering and formal methods. I am a member of the IEEE and the ACM.

Our research group is actively looking for PhD., master, and undergraduate students. If you are interested in Software Engineering for AI, including testing and debugging for AI/DL systems, DL mutation testing, and empirical study on the development and deployment of AI/DL systems, don't hesitate to contact me (yanhuili@nju.edu.cn).
我们组现招收博士、硕士以及本科生进组。如果对面向人工智能的软件工程学(包括人工智能/深度学习系统的测试和调试、深度学习变异测试、人工智能/深度学习系统的开发和部署的经验研究等)感兴趣的同学请联系我(yanhuili@nju.edu.cn)。

Teaching

  • Discrete Mathematics: Spring 2023

  • Data Structure and Financial Algorithm: Autumn 2016-2023

  • Algorithm Design and Analysis: Spring 2015, Spring 2016

Awards

  • 2019年南京大学第十五届“我最喜爱的老师”(全校9位)

  • 2020年南京大学第十六届“我最喜爱的老师”(全校7位)

  • 2022年承担的《数据结构与金融算法》被评为南京大学计算机科学与技术系本科毕业生“我心目中的好课程”

News

  • 2023-11-01. Journal paper accepted to IST: "Why and How Bug Blocking Relations are Breakable: An Empirical Study on Breakable Blocking Bugs"

  • 2023-10-30. Journal paper accepted to TSE: "Mitigating false positive static analysis warnings: Progress, challenges, and opportunities"

  • 2023-10-28. Journal paper accepted to JSME: "Just-in-time Identification for Cross Project Correlated Issues"

  • 2023-10-10. Conference paper accepted to ICSE’24: "Knowledge Graph Driven Inference Testing for Question Answering Software"

  • 2023-07-28. Conference paper accepted to FSE’23: "Semantic Test Repair for Web Applications"

  • 2023-07-18. Conference paper accepted to ASE’23: "Causality-Aided Trade-off Analysis for Machine Learning"

  • 2023-02-22. Conference paper accepted to ICPC’23: "How Well Static Type Checkers Work with Gradual Typing? A Case Study on Python"

  • 2023-01-16. Conference paper accepted to ISSTA’23: "Back Deduction based Testing for Word Sense Disambiguation Ability of Machine Translation Systems"

  • 2022-12-14. Journal paper accepted to TOSEM: "Code-line-level bugginess identification: How far have we come, and how far have we yet to go?"

  • 2022-10-17. Journal paper accepted to TSE: "Towards the Analysis and Completion of Syntactic Structure Ellipsis for Inline Comments"

  • 2022-07-12. Journal paper accepted to TSE: "Towards Better Dependency Management: A First Look at Dependency Smells in Python Projects"

  • 2022-05-17. Journal paper accepted to IST: "How Higher Order Mutant Testing Performs for Deep Learning Models: A Fine-grained Evaluation of Test Effectiveness and Efficiency Improved from Second-Order Mutant-Classification Tuples"

  • 2022-03-02. Journal paper accepted to TSE: "Inconsistent defect labels: essence, causes, and influence"

  • 2021-12-15. Journal paper accepted to TOSEM: "Mutant reduction evaluation: what is there and what is missing?"

  • 2021-12-03. Conference paper accepted to ICSE’22: "Training Data Debugging for the Fairness of Machine Learning Software"

Research

My research interests include

  • AI Testing and Debugging

  • Mutation Testing

  • Empirical software engineering

  • Software analysis

  • Knowledge engineering

  • Formal methods

Recent Publication

* corresponding authors

2024

  1. Jun Wang, Yanhui Li*, Zhifei Chen, Lin Chen, Xiaofang Zhang, Yuming Zhou: "Knowledge Graph Driven Inference Testing for Question Answering Software". The 46th International Conference on Software Engineering (ICSE 2024). CCF-A

  2. Hao Ren, Yanhui Li*, Lin Chen, Yuming Zhou, Changhai Nie: "Why and How Bug Blocking Relations are Breakable: An Empirical Study on Breakable Blocking Bugs". Information and Software Technology, Volume 166, February 2024. CCF-B

2023

  1. Jun Wang, Yanhui Li*, Xiang Huang, Lin Chen, Xiaofang Zhang, Yuming Zhou: "Back Deduction based Testing for Word Sense Disambiguation Ability of Machine Translation Systems". The 32nd ACM SIGSOFT International Symposium on Software Testing and Analysis (ISSTA 2023). CCF-A

  2. Xiaofang Qi, Xiang Qian, Yanhui Li*: "Semantic Test Repair for Web Applications". The ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering (ESEC/FSE 2023). CCF-A

  3. Shiran Liu, Zhaoqiang Guo, Yanhui Li*, Chuanqi Wang, Lin Chen*, Zhongbin Sun, Yuming Zhou*, Baowen Xu: "Inconsistent defect labels: essence, causes, and influence". IEEE Trans. Software Eng. 49(2): 586-610 (2023). CCF-A

  4. Zhenlan Ji, Pingchuan Ma*, Shuai Wang*, Yanhui Li: "Causality-Aided Trade-off Analysis for Machine Learning Fairness". 38th IEEE/ACM International Conference on Automated Software Engineering (ASE2023). CCF-A

  5. Yulu Cao, Lin Chen*, Wanwangying Ma, Yanhui Li, Yuming Zhou, Linzhang Wang: "Towards Better Dependency Management: A First Look at Dependency Smells in Python Projects". IEEE Trans. Software Eng. 49(4): 1741-1765 (2023). CCF-A

  6. Xiaowei Zhang, Weiqin Zou*, Lin Chen*, Yanhui Li, Yuming Zhou: "Towards the analysis and completion of syntactic structure ellipsis for inline comments". IEEE Trans. Software Eng. 49(4): 2285-2302 (2023). CCF-A

  7. Zhaoqiang Guo, Shiran Liu, Xutong Liu, Wei Lai, Mingliang Ma, Xu Zhang, Chao Ni, Yibiao Yang*, Yanhui Li, Lin Chen, Guoqiang Zhou*, Yuming Zhou*: "Code-line-level bugginess identification: How far have we come, and how far have we yet to go?". ACM Trans. Softw. Eng. Methodol. 32, 4, Article 102 (July 2023), 55 pages. CCF-A

  8. Wenjie Xu, Lin Chen*, Chenghao Su, Yimeng Guo, Yanhui Li, Yuming Zhou, Baowen Xu: "How Well Static Type Checkers Work with Gradual Typing? A Case Study on Python". The 31st ACM/IEEE International Conference on Program Comprehension (ICPC 2023). CCF-B

  9. Hao Ren, Yanhui Li*, Lin Chen, Yulu Cao, Xiaowei Zhang, Changhai Nie: "Just-in-time Identification for Cross Project Correlated Issues". Journal of Software: Evolution and Process. accepted. CCF-B

2022

  1. Yanhui Li, Linghan Meng, Lin Chen*, Li Yu, Di Wu, Yuming Zhou, Baowen Xu: "Training Data Debugging for the Fairness of Machine Learning Software". The 44th International Conference on Software Engineering (ICSE2022). CCF-A

  2. Peng Zhang, Yanhui Li*, Wanwangying Ma, Yibiao Yang, Lin Chen*, Hongmin Lu, Yuming Zhou*, Baowen Xu*: "CBUA: A probabilistic, predictive, and practical approach for evaluating test suite effectiveness". IEEE Transactions on Software Engineering, vol. 48, no. 3, pp. 1067-1096, 1 March 2022, doi: 10.1109/TSE.2020.3010361. CCF-A

  3. Peng Zhang, Yang Wang, Xutong Liu, Yanhui Li*, Yibiao Yang, Ziyuan Wang, Xiaoyu Zhou, Lin Chen*, Yuming Zhou*: "Mutant reduction evaluation: what is there and what is missing?". ACM Trans. Softw. Eng. Methodol. 31, 4, Article 69 (October 2022), 46 pages. https:doi.org10.11453522578. CCF-A

  4. Yanhui Li, Weijun Shen, Tengchao Wu, Lin Chen*, Di Wu, Yuming Zhou, Baowen Xu: "How Higher Order Mutant Testing Performs for Deep Learning Models: A Fine-grained Evaluation of Test Effectiveness and Efficiency Improved from Second-Order Mutant-Classification Tuples". Information and Software Technology 2022. CCF-B

2021

  1. Linghan Meng, Yanhui Li*, Lin Chen, Zhi Wang, Di Wu, Yuming Zhou, Baowen Xu: "Measuring Discrimination to Boost Comparative Testing for Multiple Deep Learning Models". 2021 IEEE/ACM 43rd International Conference on Software Engineering (ICSE), 2021, pp. 385-396. CCF-A

  2. Zhaoqiang Guo, Shiran Liu, Jinping Liu, Yanhui Li*, Lin Chen*, Hongming Lu, Yuming Zhou*: "How far have we progressed in identifying self-admitted technical debts? A comprehensive empirical study". ACM Trans. Softw. Eng. Methodol. 30(4): 45:1-45:56 (2021). CCF-A

  3. Henghui Zhao, Yanhui Li*, Fanwei Liu, Xiaoyuan Xie, Lin Chen: "State and tendency: an empirical study of deep learning question&answer topics on stack overflow". Sci. China Inf. Sci. 64(11) (2021). CCF-A

  4. Weijun Shen, Yanhui Li*, Yuanlei Han, Lin Chen, Di Wu, Yuming Zhou, Baowen Xu: "Boundary Sampling to Boost Mutation Testing for Deep Learning Models". Information and Software Technology 2021. CCF-B

  5. Shiran Liu, Zhaoqiang Guo, Yanhui Li*, Hongmin Lu, Lin Chen, Lei Xu, Yuming Zhou*, Baowen Xu: "Prioritizing code documentation effort: Can we do it simpler but better?". Information and Software Technology, Volume 140, December 2021, 106686. CCF-B

2020

  1. Weijun Shen, Yanhui Li*, Lin Chen, YuanLei Han, Yuming Zhou, Baowen Xu: "Multiple-Boundary Clustering and Prioritization to Promote Neural Network Retraining". 35th IEEE/ACM International Conference on Automated Software Engineering (ASE ’20). 410–422. CCF-A

  2. Chuanqi Wang, Yanhui Li*, Lin Chen, Wenchin Huang, Yuming Zhou, Baowen Xu: "Examining the effects of developer familiarity on bug fixing". Journal of Systems and Software. Volume 169 (2020). CCF-B

  3. Hao Ren, Yanhui Li*, Lin Chen*: "An Empirical Study on Critical Blocking Bugs". 28th IEEE/ACM International Conference on Program Comprehension (ICPC 2020): 72-82. CCF-B

  4. Zhaoqiang Guo, Yanhui Li, Wanwangying Ma, Yuming Zhou*, Hongmin Lu, Lin Chen, Baowen Xu: "Boosting crash-inducing change localization with rank-performance-based feature subset selection". Empir. Softw. Eng. 25(3): 1905-1950 (2020). CCF-B

  5. Zhifei Chen, Yanhui Li, Bihuan Chen, Wanwangying Ma, Lin Chen*, Baowen Xu*: "An Empirical Study on Dynamic Typing Related Practices in Python Systems". ICPC 2020: 83-93. CCF-B

Current Projects

  • "Model-based code automatic generation". The Joint Project with Nanjing University of Aeronautics and Astronautics. 2023.05-2024.05

  • "Semantic based testing data efficacy measurement for deep learning models". National Natural Science Foundation of China(General Program). 2022.01-2025.12.

  • "Software quality assurance in the mixed language programming environment". Cooperation Fund of Huawei-Nanjing University Next Generation Programming Innovation Lab (YBN2019105178SW35). 2022.03-2023.02.