model compression via distillation and quantization