Welcome to the DeepWok Lab

The DeepWok Lab, is an ML research group led by Dr. Aaron Zhao, where the group members are mainly from Imperial College London and the University of Cambridge.

Members

Aaron Zhao (Faculty, PI)
Binglei Lou (PostDoc)
Gary Guo (PostDoc, Part-time)
Emma Fromant (Project Manager)
Cheng Zhang (PhD Student, co-supervised with Prof. George Constantinides)
Victor Zhao (PhD Student, co-supervised with Prof. Pietro Lio)
Zehui Li (PhD Student, co-supervised with Prof. Guy-Bart Stan*)
Mingzhu Shen (PhD Student, co-supervised with Prof. Christos Bouganis)
Pedro Gimense (PhD Student, co-supervised with Prof. George Constantinides)
Timon Schneider (PhD Student, co-supervised with Prof. Tom Ellis*, and Prof. Guy-Bart Stan)
Eleanor Clifford (PhD Student, co-supervised with Prof. Ross Anderson and Prof. Robert Mullins)
Can Xiao (PhD Student, co-supervised with Dr. Jianyi Cheng)
Jeffrey Tsz Hang Wong (PhD Student, co-supervised with Prof. Wayne Luk)
Przemyslaw Forys (PhD Student, co-supervised with Prof. Wayne Luk)
Hanna Foerster (PhD Student, co-supervised with Prof. Robert Mullins*)
Ying Yu (PhD Student, co-supervised with Dr. Fei Teng*)
Keran Zheng (PhD Student, co-supervised with Prof. Christos Bouganis*)
Tony Liu (PhD Student)

* indicates the primary supervisor, if not indicated, Aaron Zhao is the primary supervisor.

Student Projects

Each year, we run and supervise a number of students for their undergraduate and master projects at Imperial College London and the University of Cambridge. We also run a great number of summer research internships.

We finished hiring for the 2025 summer, please only contact me for 2025-2026 MEng/Mphil/MSc/Part II/Part III/Intern projets

Instead of listing the projects here, I found it is generally easier to provide the students with our interested research topics so that interested students can reach out to us. Here are some of the topics and large projects we are working on:

Project MASE: MASE aims to provide a unified representation for software-defined ML heterogeneous system exploration. We are looking for students who are interested in working on this project with an ideal background and interest in ML System, Efficient ML and ML hardware acceleration.
Beyond Structure Data: We are interested in working on projects that involve unstructured and multimodal data, such as graphs, hypergraphs and combinational complex. We envision these data types would be the eanbler for the next generation of AI systems that goes beyond simple images and text. And we are looking for students who are interested in working on these projects.
Efficient AI: We are interested in different aspects of efficient AI, including efficient training, efficient inference, efficient model search and efficient model deployment with state-of-the-art GenAI models (eg. language and diffusion models).
System-level AI Safety: with the increasing capability of GenAI models and the growing complexity of AI systems, we are interested in working on projects that focus on the system-level AI safety, including robustness, security, and red-teaming these models to understand new vulnerabilities.

I am also happy to host self-proposed projects if it matches the our research interests. Feel free to contact a.zhao@imperial.ac.uk if you would like to do a project with us!

Publication

Year 2025

Flareon: Stealthy all2all Backdoor Injection via Poisoned Augmentation; T Qin, X Wang, X He, Y Zhao, K Ye, CZ Xu, X Gao; ACM Transactions on Knowledge Discovery from Data (TKDD 2025)

Omni-DNA: A Unified Genomic Foundation Model for Cross-Modal and Multi-Task Learning; Z Li, V Subasri, Y Shen, D Li, Y Zhao, GB Stan, C Shan; The Thirty-nine Conference on Neural Information Processing Systems (NeurIPS 2025)

Mixture of Weight-shared Heterogeneous Group Attention Experts for Dynamic Token-wise KV Optimization; G Song, D Liao, Y Zhao, K Ye, C Xu, X Gao; The 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2025)

Refining Salience-Aware Sparse Fine-Tuning Strategies for Language Models; X Liu, A Thomas, C Zhang, J Cheng, Y Zhao, X Gao; The 63rd Annual Meeting of the Association for Computational Linguistics (ACL 2025)

Refining Datapath for Microscaling ViTs; C Xiao, J Cheng, Y Zhao; 35th International Conference on Field-Programmable Logic and Applications (FPL 2025)

Hardware and Software Platform Inference; C Zhang, H Foerster, R Mullins, Y Zhao, I Shumailov; International Conference on Machine Learning (ICML 2025)

LLM4DV: Using Large Language Models for Hardware Test Stimuli Generation; Z Zhang, B Szekely, P Gimenes, G Chadwick, H McNally, J Cheng, R Mullins, Y Zhao; The 33rd IEEE International Symposium on Field-Programmable Custom Computing Machines (FCCM 2025, Short paper)

Microscaling Vision Transformers on FPGAs; C Xiao, J Cheng, Y Zhao; The 33rd IEEE International Symposium on Field-Programmable Custom Computing Machines (FCCM 2025, Poster)

AMPLE: Event-Driven Accelerator for Mixed-Precision Inference of Graph Neural Networks; P Gimense, G Constantinides, Y Zhao; The 5th Workshop on Machine Learning and Systems (EuroMLSys 2025)

Cached Multi-Lora Composition for Multi-Concept Image Generation; X Zou, M Shen, C Bouganis, Y Zhao; 2025 International Conference on Learning Representations (ICLR 2025)

QERA: an Analytical Framework for Quantization Error Reconstruction; C Zhang, J TH Wong, C Xiao, G Constantinides, Y Zhao; 2025 International Conference on Learning Representations (ICLR 2025)

Locking Machine Learning Models into Hardware; E Clifford, A Saravanan, H Langford, C Zhang, Y Zhao, R Mullins, I Shumailov, J Hayes; 3rd IEEE Conference on Secure and Trustworthy Machine Learning (SatML 2025)

Architectural Neural Backdoors from First Principles; H Langford, I Shumailov, Y Zhao, R Mullins, N Papernot; IEEE Symposium on Security and Privacy 2025 (S&P 2025)

Year 2024

MD-DiT: Step-aware Mixture-of-Depths for Efficient Diffusion Transformers; M Shen, P Chen, P Ye, G Xia, T Chen, C Bouganis, Y Zhao; NeurIPS 2024 Workshop on Adaptive Foundation Models (NeurIPS 2024, AFM Workshop)

GV-Rep: A Large-Scale Dataset for Genetic Variant Representation Learning; Z Li, V Subasi, G Stan, Y Zhao, B Wang; The Thirty-eight Conference on Neural Information Processing Systems Datasets and Benchmarks Track (NeurIPS 2024, Datasets and Benchmarks Track)

Absorb & Escape: Overcoming Single Model Limitations in Generating Heterogeneous Genomic Sequences; Z Li, Y Ni, G Xia, W Beardall, A Das, G Stan, Y Zhao; The Thirty-eight Conference on Neural Information Processing Systems (NeurIPS 2024)

AI models collapse when trained on recursively generated data; I Shumailov, Z Shumaylov, Y Zhao, N Papernot, R Anderson, Y Gal; Nature 631 (Front Cover)

Enhancing Node Representations for Real-World Complex Networks with Topological Augmentation; X Zhao, Z Li, M Shen, G Stan, P Lio, Y Zhao; European Conference on Artificial Intelligence (ECAI 2024)

Unlocking the Global Synergies in Low-Rank Adapters; Z Zhang, C Zhang, X Gao, R Mullins, G Constantinides, Y Zhao; ICML 2024 Workshop ES-FoMo-II

Optimised Grouped-Query Attention Mechanism for Transformers; Y Chen, C Zhang, X Gao, R Mullins, G Constantinides, Y Zhao; ICML 2024 Workshop ES-FoMo-II

HASS: Hardware-Aware Sparsity Search for Dataflow DNN Accelerators; Z Yu, S Sreeram, K Agrawal, J Wu, A Montgomerie-Corcoran, C Zhang, J Cheng, C Bouganis, Y Zhao; The International Conference on Field-Programmable Logic and Applications (FPL 2024)

LQER: Low-Rank Quantization Error Reconstruction for LLMs; C Zhang, J Cheng, G Constantinides, Y Zhao; International Conference on Machine Learning (ICML 2024)

ImpNet: Imperceptible and Blackbox-undetectable Backdoors in Compiled Neural Networks; E Clifford, I Shumailov, Y Zhao, R Anderson, R Mullins; 2nd IEEE Conference on Secure and Trustworthy Machine Learning (SaTML 2024)

Year 2023

Will More Expressive Graph Neural Networks do Better on Generative Tasks?; X Zou, X Zhao, P Lio, Y Zhao; The Second Learning on Graphs Conference (LOG 2023)

Latent Diffusion Model for DNA Sequence Generation; Z Li, Y Ni, T Huygelen, A Das, G Xia, G Stan, Y Zhao; Conference on Neural Information Processing Systems, AI for Science Workshop (NeurIPS 2023, AI for Science Workshop)

MASE: An Efficient Representation for Software-Defined ML Hardware System Exploration; C Zhang, J Cheng, Z Yu, Y Zhao; Conference on Neural Information Processing Systems, Machine Learning for Systems Workshop (NeurIPS 2023, ML for Systems Workshop)

Dynamic Stashing Quantization for Efficient Transformer Training; G Yang, D Lo, R Mullins, Y Zhao; The 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2023, findings)

Revisiting Block-based Quantisation: What is Important for Sub-8-bit LLM Inference?; C Zhang, J Cheng, I Shumailov, G Constantinides, Y Zhao; The 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2023)

MiliPoint: A Point Cloud Dataset for mmWave Radar; H Cui, S Zhong, J Wu, Z Shen, N Dahnoun, Y Zhao; Conference on Neural Information Processing Systems (NeurIPS 2023, Datasets and Benchmarks Track)

Revisiting Structured Dropout; Y Zhao, O Dada, X Gao, RD Mullins; The 15th Asian Conference on Machine Learning (ACML 2023)

Genomic Interpreter: A Hierarchical Genomic Deep Neural Network with 1D Shifted Window Transformer; Z Li, A Das, WAV Beardall, Y Zhao, GB Stan; The 2023 ICML Workshop on Computational Biology (ICML-WCB 2023, contributed talk, best paper award)

Revisiting Automated Prompting: Are We Actually Doing Better?; Y Zhou, Y Zhao, I Shumailov, R Mullins, Y Gal; Association for Computational Linguistics 2023 (ACL 2023)

Task-Agnostic Graph Neural Network Evaluation via Adversarial Collaboration; X Zhao, H Stärk, D Beaini, P Liò, Y Zhao; ICLR 2023 - Machine Learning for Drug Discovery workshop (ICLR 2023 MLDD workshop)

Augmentation Backdoors; J Rance, Y Zhao, I Shumailov, R Mullins; ICLR 2023 Workshop on Backdoor Attacks and Defenses in Machine Learning (ICLR 2023 BANDS Workshop)

Adaptive Channel Sparsity for Federated Learning under System Heterogeneity; X Gao, D Liao, Y Zhao, C Xu; The IEEE / CVF Computer Vision and Pattern Recognition Conference (CVPR 2023)

Architectural Backdoors in Neural Networks; M Bober-Irizar, I Shumailov, Y Zhao, R Mullins, N Papernot; The IEEE / CVF Computer Vision and Pattern Recognition Conference (CVPR 2023)

Year 2022

Revisiting Embeddings for Graph Neural Networks; S Purchase, Y Zhao, R Mullins; The First Learning on Graphs Conference (LOG 2022)

Wide Attention Is The Way Forward For Transformers; J R Brown, Y Zhao, I Shumailov, R Mullins; All Things Attention: Bridging Different Perspectives on Attention, Oral, (NeurIPS 2022 Workshop)

DARTFormer: Finding The Best Type Of Attention; J R Brown, Y Zhao, I Shumailov, R Mullins; ICBINB, (NeurIPS 2022 Workshop)

Rapid Model Architecture Adaption for Meta-Learning; Y Zhao, X Gao, I Shumailov, N Fusi, R Mullins; Advances in Neural Information Processing Systems 35 (NeurIPS 2022)

DAdaQuant: Doubly-adaptive quantization for communication-efficient Federated Learning; R Hönig, Y Zhao, R Mullins; International Conference on Machine Learning (ICML 2022)

Past and Current Students

Academic Year 2024-2024

Ben Zhang (Part III Project, University of Cambridge)
Adam Low (MSc Project, Imperial College London)
Yulin Lin (MSc Project, Imperial College London)
Zixing Bao (MSc Project, Imperial College London)
Junwei Wang (MSc Project, Imperial College London)
Eduardo Fernandez (MSc Project, Imperial College London)
Junyi Wu (Final Year Project, Imperial College London)
Harry Ni (Final Year Project, Imperial College London)
Edward Xiao (Final Year Project, Imperial College London)
Ratul Shek (Final Year Project, Imperial College London)

Academic Year 2023-2024

Henry Li (Summer Research Intern, University of Cambridge)
Ben Zhang (Summer Research Intern, University of Cambridge)
Harry Langford (Summer Research Intern, University of Cambridge)
Sanjit Raman (Summer Research/Teaching Intern, Imperial College London)
Kevin Lau (Summer Research/Teaching Intern, Imperial College London)
Xiandong Zou (Summer Research Intern, Imperial College London)
Roshan Aekote(Summer Research Intern, Imperial College London)
Li Wang(MSc project, Imperial College London)
Charles Jin(MSc project, Imperial College London)
Yichen Li(MSc project, Imperial College London)
Przemyslaw Forys (MSc project, Imperial College London)
Yuhe Zhang (Final Year Project, Imperial College London)
Bryan Tan (Final Year Project, Imperial College London)
Balint Szekely (Final Year Project, Imperial College London)
Derek Lai (Final Year Project, Imperial College London)
Bakhtiar Mohammadzadeh (Final Year Project, Imperial College London)
TszHang Wong (Final Year Project, Imperial College London)
Ben Zhang (Part II Project, University of Cambridge)
Bradley Chen (Part II Project, University of Cambridge)
Kate Liang (Part II Project, University of Cambridge)

Academic Year 2022-2023

David Gyulamiryan (Summer Research Intern, University of Cambridge)
Eduard Burlacu (Summer Research Intern, University of Cambridge)
Harry Langford (Summer Research Intern, University of Cambridge)
Ben Zhang (Summer Research Intern, University of Cambridge)
Leah He (Summer Research Intern, University of Cambridge)
Junyi Wu (Summer Research Intern, Imperial College London)
Harry Ni (Summer Research Intern, Imperial College London)
Xiandong Zou (Summer Research Intern, Imperial College London)
Anthony Bolton (Summer Research Intern, Imperial College London)
Aaron Thomas (Summer Research Intern, Imperial College London)
Sudarshan Sreeram (Summer Research Intern, Imperial College London)
Diego Van Overberghe (Summer Research Intern, Imperial College London)
Bryan Tan (Summer Research Intern, Imperial College London)
TszHang Wong (Summer Research Intern, Imperial College London)
Aman Vernekar (Summer Research Intern, University of Cambridge)
Eduard Burlacu (Summer Research Intern, University of Cambridge)
Harry Langford (Summer Research Intern, University of Cambridge)
Haoliang Shang (BEng Project, Imperial College London / ETH Zurich)
Jacky Choi (BEng Project, Imperial College London / ETH Zurich)
Can Xiao (MSc Project, Imperial College London)
Sheng Luo (MSc Project, Imperial College London)
Chuiyu Wang (MSc Project, Imperial College London)
Pedro Gimense (Final Year Project, Imperial College London)
Nickolaos Ilioudis (Final Year Project, Imperial College London)
Issa Bqain (Final Year Project, Imperial College London)
Tobias Cook (Final Year Project, Imperial College London)
Peter Barabas (Final Year Project, Imperial College London)
Ritvik Shyam(Final Year Project, Imperial College London)
Harry Knighton (Part II project, University of Cambridge)
Fredrik Ekholm (Part II project, University of Cambridge)
Thomas Yuan (Part II project, University of Cambridge)
Kyra Zhou (Part II project, University of Cambridge)

Academic Year 2021/2022

Eleanor Clifford (Summer Research Intern, from University of Cambridge)
Joseph Rance (Summer Research Intern, from University of Cambridge)
Victor Zhao (Summer Research Intern, from University of Cambridge)
Skye Purchase (Summer Research Intern, from University of Cambridge)
Cindy Wu (Summer Research Intern, from University of Cambridge)
Guo Yang (Summer Research Intern, from University of Cambridge)
Prisha Satwani (Summer Research Intern, from LSE)
Jason Brown (Summer Research Intern, from University of Cambridge)