Publications
Venue:
Theme:
Conference
FLoRIST: Singular Value Thresholding for Efficient and Accurate Federated Fine-Tuning of Large Language Models
Annual Conference on Machine Learning and Systems (MLSys), 2026
Acceptance rate: 26.7%
ViTALiTy: Unifying Low-rank and Sparse Approximation for Vision Transformer Acceleration with a Linear Taylor Attention
IEEE International Symposium on High-Performance Computer Architecture (HPCA), 2023
Acceptance rate: 25%
Householder Sketch for Accurate and Accelerated Least-Mean-Squares Solvers
International Conference on Machine Learning (ICML), 2021
Acceptance rate: 21.47%
FPGA-based Distributed Edge Training of SVM
ACM/SIGDA International Symposium on Field Programmable Gate Arrays (FPGA), 2019
ConvLight: A Convolutional Accelerator with Memristor Integrated Photonic Computing
IEEE International Conference on High Performance Computing (HiPC), 2017
Acceptance rate: 23%
Distributed QR Decomposition Framework for Training Support Vector Machines
IEEE International Conference on Distributed Computing Systems (ICDCS), 2017
Acceptance rate: 16.9%
A Relaxed Synchronization Approach for Solving Parallel Quadratic Programming Problems with Guaranteed Convergence
IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2016
Acceptance rate: 23%
A density based method for automatic hairstyle discovery and recognition
National Conference on Computer Vision, Pattern Recognition, Image Processing and Graphics (NCVPRIPG), 2013
Journal
NetDistiller: Empowering Tiny Deep Learning via In-Situ Distillation
IEEE Micro, 2023 — Special Issue on tinyML
Impact factor: 3.6
Distributed Training of Support Vector Machine on a Multiple-FPGA system
IEEE Transactions on Computers (TC), 2020 — Special Issue on Machine-Learning Architectures and Accelerators
Acceptance rate: 21%
Impact factor: 3.131
Fast and Communication-Efficient Algorithm for Distributed Support Vector Machine Training
IEEE Transactions on Parallel and Distributed Systems (TPDS), 2018
Impact factor: 3.402