🧑‍🎓 About Me

I graduated in 2024 with a Bachelor’s degree in Artificial Intelligence from Chien-Shiung Wu College, Southeast University (SEU, 985 & 211). Afterwards, I was admitted without examination to the Master’s program at Beijing Electronic Science and Technology Institute (BESTI), where I initially considered a civil service career after encountering some early research setbacks. However, I am fortunate to be advised by Dr. Xiaojun Jia, a postdoctoral researcher at Nanyang Technological University, and to collaborate with Weixin Wang, Haoxuan Ma, and many other friends. Following a one-year internship at Alibaba Security under the mentorship of Dr. Ranjie Duan, where I have the privilege of working alongside colleagues whose guidance and collaboration have broadened my perspective and deepened my commitment to research, I am currently an AI Governance Research Intern at TeleAI mentored by Tianle Zhang, while also conducting research under the guidance of Prof. Xingjun Ma at Fudan University. These ongoing experiences continually inspire me and reinforce my determination to pursue a Ph.D. in trustworthy AI.

📝 Selected Papers

CVPR 2026 (Main)

EcoAlign: An Economically Rational Framework for Efficient LVLM Alignment [Paper]

Ruoxi Cheng, Haoxuan Ma, Teng Ma, Hongyi Zhang

The IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2026).

ICLR 2026

Inverse Reinforcement Learning with Dynamic Reward Scaling for LLM Alignment [Paper] [Code]

Ruoxi Cheng, Haoxuan Ma, Weixin Wang, Ranjie Duan, Jiexi Liu, Xiaoshuang Jia, Simeng Qin, Xiaochun Cao, Yang Liu, Xiaojun Jia

The Fourteenth International Conference on Learning Representations (ICLR 2026)

EMNLP 2025 (Main)

PBI-Attack: Prior-Guided Bimodal Interactive Black-Box Jailbreak Attack for Toxicity Maximization [Paper] [Code]

Ruoxi Cheng, Yizhong Ding, Shuirong Cao, Ranjie Duan, Xiaoshuang Jia, Shaowei Yuan, Zhiqiang Wang, Xiaojun Jia

The 2025 Conference on Empirical Methods in Natural Language Processing (EMNLP 2025) Main Conference

MM 2026 (Under Review)

AOD: Adversarial Orthogonal Disentanglement for LVLM Hallucination Mitigation

Ruoxi Cheng, Haoxuan Ma, Zhengfei Hai, Yiyan HUANG, Ranjie Duan, Tianle Zhang, Xu Yang, Ziyi Ye, Xingjun Ma

Under review at MM 2026.

Pattern Recognition 2026 (Under Review)

Membership Inference for Contrastive Pre-training Models with Text-only PII Queries

Ruoxi Cheng, Yizhong Ding, Hongyi Zhang, Yiyan Huang

Under review at Pattern Recognition 2026.

🎖 Honors and Awards

2025, First Prize (Rank 3), “Huawei Cup” National Cybersecurity Innovation Competition
2025, Second Prize (Top 12%), “Huawei Cup” China Graduate Mathematical Modeling Competition
2025, National Scholarship (Top 1%)
2025, Grand Prize (Rank 1), National Cybersecurity Attack and Defense Software Competition
2025, Second Prize (Top 5%), National Information Security Contest & “Great Wall Cup” Information Security Triathlon
2025, Second Prize, China’s Innovation Challenge on Artifcial Intelligence Application Scene (CICAS 2025)
2024 & 2025, First‑Class Academic Scholarship
2025, Second Prize, National Software Innovation Competition — North China Region
2024, Outstanding Undergraduate Thesis (Top 3%)
2021, Merit Student Award of Southeast University
2021, Second Prize, National Undergraduate Mathematics Competition
2018 & 2019, First Prize, Chinese Mathematical Olympiad — Jiangsu Province

📖 Educations

2024.09 – Present, M.S., Beijing Electronic Science and Technology Institute, Beijing, China
2020.09 – 2024.06, B.S., Chien-shiung Wu College, Southeast University, Nanjing, China

💻 Internships

TeleAI — Mar 2026 – present (Supervisor: Tianle Zhang)

Researching agent reasoning RL for safety and adversarial red teaming.

Alibaba Security — Feb 2025 – Mar 2026 (Supervisor: Ranjie Duan)

Engaged in alignment training and evaluation of LLMs, contributing to the technical report:
- Oyster-I: Beyond Refusal — Constructive Safety Alignment for Responsible Language Models (Alibaba AAIG) [paper] — Principal Contributor (Fifth Author)
Co-inventor on two Alibaba Innovation Proposal patents:
- User–Model Interactive Security Guidance Mechanism Based on Game Theory — Fifth Inventor
- A Method for Constructing Chinese–English Safety Evaluation Datasets Based on Inference Complexity Grading — Sixth Inventor

💃 Skills & Interests

Languages: English – IELTS 7.0 (Listening 7.5, Reading 8.0, Writing 6.5, Speaking 6.0); CET-6: 585
Programming: Python, C++, PyTorch, TensorFlow, MySQL, Navicat, SPSS
Security: NISP Level-2 Certification, administered by China Information Security Evaluation Center
Chinese Dance: Level-10 Excellence; First Prize, Solo Dance Competition, Wuxi, Jiangsu Province (2015, 2018)

Sep 2024 – now, Program Committee Member or Reviewer for conferences and journals, including NeurIPS, ACL, ICASSP, ICMR, and The Frontiers of Computer Science
Jul 2021 – Aug 2021, Village Elementary School Teaching, Xiangyang, Hubei – Volunteer, National Education Support Project
Jun 2019 – Sep 2021, Talented Youth Initiative, Peking University – Vice Leader of Applied Science & Engineering Study Group

ClustrMaps Globe