On Compressing The Embedding Matrix Of Language Models For Edge Deployment

Vasileios Lioutas, Ahmad Rashid, and Krtin Kumar

Edge Intelligence Workshop