Edinburgh · ML · System

Yangshen Deng

Building scalable systems for Data & AI

YD Yangshen Deng

I am a first-year PhD student in the MLSys CDT at the University of Edinburgh, advised by Prof. Luo Mai and Prof. Edoardo M. Ponti. My work builds scalable systems for data and AI.

I received my bachelor’s degree from BUPT in 2022 and my master’s degree from SUSTech in 2025, where I worked in DBGroup with Prof. Bo Tang. I helped co-found AlayaDB.AI, and have interned at Ant Group, GaussDB Huawei, RS3Lab@EPFL, and Systems@TUDa.

News

Mar 2026

Our paper "AlayaLaser" about on-disk vector search is accepted by SIGMOD '26. We reveal the superising shift from I/O-bound to compute-bound. Click here to check how on-disk vector search can be faster than in-memory ones.

Apr 2025

Our paper "ParaGraph" is accepted by DaMoN '25 at SIGMOD.

Feb 2025

Our paper "AlayaDB" is accepted by SIGMOD '25 industry track. The first database for KV cache and attention.

Selected publications

SIGMOD 2026

Efficient Index Layout and Search Strategy for Large-scale High-dimensional Vector Similarity Search

Weijian Chen, Haotian Liu, Yangshen Deng, Long Xiang, Liang Huang, Gezi Li, Bo Tang

Proc. ACM Manag. Data, 2026, vol. 4

DaMoN 2025

ParaGraph: Accelerating Graph Indexing through GPU-CPU Parallel Processing for Efficient Cross-modal ANNS

Yuxiang Yang, Shiwen Chen, Yangshen Deng, Bo Tang

Proceedings of the 21st International Workshop on Data Management on New Hardware, 2025

SIGMOD 2025

AlayaDB: The Data Foundation for Efficient and Effective Long-context LLM Inference

Yangshen Deng, Zhengxin You, Long Xiang, Qilong Li, Peiqi Yuan, Zhaoyang Hong, Yitao Zheng, Wanting Li, Runzhong Li, Haotian Liu, Kyriakos Mouratidis, Man Lung Yiu, Huan Li, Qiaomu Shen, Rui Mao, Bo Tang

Companion of the 2025 International Conference on Management of Data, 2025, pp. 364-377

Mar 2026

Our paper "AlayaLaser" about on-disk vector search is accepted by SIGMOD '26. We reveal the superising shift from I/O-bound to compute-bound. Click here to check how on-disk vector search can be faster than in-memory ones.

Apr 2025

Our paper "ParaGraph" is accepted by DaMoN '25 at SIGMOD.

Feb 2025

Our paper "AlayaDB" is accepted by SIGMOD '25 industry track. The first database for KV cache and attention.

Jan 2025

I am awarded the BYD Scholarship.

Apr 2024

Our paper "How Does Software Prefetching Work on GPU Query Processing?" is accepted by DaMoN '24 at SIGMOD.

Mar 2024

Our paper "Accelerating Merkle Patricia Trie with GPU" is accepted by PVLDB '24.

Sep 2022

Our paper "GHive" is accepted by SoCC '22.

Feb 2022

Our paper "GHive" is accepted by SIGMOD '22 demo track.

SIGMOD 2026

Efficient Index Layout and Search Strategy for Large-scale High-dimensional Vector Similarity Search

Weijian Chen, Haotian Liu, Yangshen Deng, Long Xiang, Liang Huang, Gezi Li, Bo Tang

Proc. ACM Manag. Data, 2026, vol. 4

arXiv 2026

ECHO-2: A Large-Scale Distributed Rollout Framework for Cost-Efficient Reinforcement Learning

Jie Xiao, Meng Chen, Qingnan Ren, Jingwei Song, Jiaqi Huang, Yangshen Deng, Chris Tong, Wanyi Chen, Suli Wang, Ziqian Bi, Shuo Lu, Yiqun Duan, Xu Wang, Rymon Yu, Ween Yang, Lynn Ai, Eric Yang, Bill Shi

arXiv:2602.02192, 2026

DaMoN 2025

ParaGraph: Accelerating Graph Indexing through GPU-CPU Parallel Processing for Efficient Cross-modal ANNS

Yuxiang Yang, Shiwen Chen, Yangshen Deng, Bo Tang

Proceedings of the 21st International Workshop on Data Management on New Hardware, 2025

SIGMOD 2025

AlayaDB: The Data Foundation for Efficient and Effective Long-context LLM Inference

Yangshen Deng, Zhengxin You, Long Xiang, Qilong Li, Peiqi Yuan, Zhaoyang Hong, Yitao Zheng, Wanting Li, Runzhong Li, Haotian Liu, Kyriakos Mouratidis, Man Lung Yiu, Huan Li, Qiaomu Shen, Rui Mao, Bo Tang

Companion of the 2025 International Conference on Management of Data, 2025, pp. 364-377

arXiv 2025

NeuroTTT: Bridging Pretraining-Downstream Task Misalignment in EEG Foundation Models via Test-Time Training

Suli Wang, Yangshen Deng, Zhenghua Bao, Xinyu Zhan, Yiqun Duan

arXiv:2509.26301, 2025

DaMoN 2024

How Does Software Prefetching Work on GPU Query Processing?

Yangshen Deng, Shiwen Chen, Zhaoyang Hong, Bo Tang

Proceedings of the 20th International Workshop on Data Management on New Hardware, 2024

PVLDB 2024

Accelerating Merkle Patricia Trie with GPU

Yangshen Deng, Muxi Yan, Bo Tang

Proc. VLDB Endow., 2024, vol. 17, pp. 1856-1869

SoCC 2022

GHive: Accelerating Analytical Query Processing in Apache Hive via CPU-GPU Heterogeneous Computing

Haotian Liu, Bo Tang, Jiashu Zhang, Yangshen Deng, Xiao Yan, Xinying Zheng, Qiaomu Shen, Dan Zeng, Zunyao Mao, Chaozu Zhang, Zhengxin You, Zhihao Wang, Runzhe Jiang, Fang Wang, Man Lung Yiu, Huan Li, Mingji Han, Qian Li, Zhenghai Luo

Proceedings of the 13th Symposium on Cloud Computing, 2022, pp. 158–172

SIGMOD 2022

GHive: A Demonstration of GPU-Accelerated Query Processing in Apache Hive

Haotian Liu, Bo Tang, Jiashu Zhang, Yangshen Deng, Xinying Zheng, Qiaomu Shen, Xiao Yan, Dan Zeng, Zunyao Mao, Chaozu Zhang, Zhengxin You, Zhihao Wang, Runzhe Jiang, Fang Wang, Man Lung Yiu, Huan Li, Mingji Han, Qian Li, Zhenghai Luo

Proceedings of the 2022 International Conference on Management of Data, 2022, pp. 2417–2420

Memories are the treasures in my life.