Yuxiang Wang

I am a first-year Master’s student at the School of Data Science, Fudan University, where I am very fortunate to be advised by Prof. Baojian Zhou at the Knowledge Works Lab.

I received my B.Eng. in Computer Science (Honors Class) from the Zhuoyue Honor College, Hangzhou Dianzi University in June 2025, where I worked with Prof. Wang.

Research Interests

My research interests span:

LLMs (Agents) — post-training, tool use, multi-turn reasoning (Thought→Action→Observation), and RL for agents
Diffusion Language Models — locality-aware architectures, training stability, and the inductive-bias / task-structure relationship
AI4S — protein/small-molecule modeling, unified evaluation frameworks, and Protein Agents
Multimodal LLMs — unified understanding & generation, e.g. protein ↔ text bidirectional modeling
Graph Neural Networks — dynamic-graph reasoning, fraud detection on spatio-temporal graphs

News

2026 — Sole first author on Locality-aware Diffusion Language Modeling (introducing the Scatter and Jigsaw blockwise architectures); second author on Semantic Diffusion Language Modeling — both studying training stability and architecture design of diffusion language models.
2025.10 — Joined Alibaba (AI4S / Multimodal LLM) as a research intern.
2025.09 — Started my MS at Fudan University, Knowledge Works Lab.
2025.06 — Graduated from HDU as an Outstanding Graduate; admitted to Fudan University via recommendation.

Selected Publications

Locality-aware Diffusion Language Modeling — Sole first author. Studies the trainability of Masked Diffusion LMs and proposes two locality-aware blockwise architectures (Scatter and Jigsaw) bridging AR and Diffusion regimes.
Spatio-Temporal Distance and Frame-Based Dynamic Graph Fraud Detection (STM) — First author, CCF-B venue. Frame-wise intra/inter aggregation over dynamic graphs achieves SOTA on multiple benchmarks. Resulted in an invention patent.
Semantic Diffusion Language Modeling (SemDLM) — Second author. Semantic-neighborhood diffusion + shared refresh branch reaches 27.19 Test PPL on LM1B.

You can find my full Curriculum Vitae for more details.

Open-Source Study Notes — Welcome to Learn!

I have open-sourced my study notes covering LLM, Multimodal, RL alignment, and NLP. Everyone is welcome to use them for learning and interview preparation!

Notes	Description	Link
LLM & Multimodal Interview Notes	Transformer, LLM Architecture, Training/RLHF/DPO, Multimodal (CLIP/LLaVA/GPT-4o), Engineering Practice	Online Reading
RL for LLM Alignment Quick Reference	Policy Gradient → PPO, GRPO, RLHF Pipeline, DPO Derivation, Algorithm Comparison	Online Reading
VLM Knowledge & Interview Guide (2025-2026)	Visual Encoder, VLM Architecture, Alignment Training, Resolution Strategy, MoE, Latest Progress	Online Reading
NLP & LLM Course Notes	Tokenization, N-gram, Embeddings, Neural LM, Transformer, GPT, BERT, RLHF (Fudan CS40008)	Online Reading