Return to Article Details Sustainability-Driven Neural Network Compression For Efficient Large-Scale Model Serving Download Download PDF