Comet: Accelerating Private Inference for Large Language Model by Predicting Activation Sparsity

Publication
IEEE Symposium on Security and Privacy (S&P)
Date