You might not need a different model, but better settings ...
Megatron-Bridge (v0.5.0), released by NVIDIA, is a library that converts Megatron-format models to Hugging Face, lowering the barrier to model migration by supporting over 15 models. At the same time, ...