you need Multi-GPU support (not implemented) Native 8-bit/4-bit quantization (not implemented)
you need
Multi-GPU support (not implemented)
Native 8-bit/4-bit quantization (not implemented)