Comet: Accelerating Private Inference for Large Language Model by Predicting Activation Sparsity

Publication
IEEE Symposium on Security and Privacy (S&P), Accepted
Date