Hugging Face Quantization Tutorial

MiniMax-M3-Tutorial.md

This tutorial demonstrates how to run MiniMax-M3 model inference using SGLang integrated with KT-Kernel for CPU-GPU heterogeneous inference. This setup enables efficient deployment of M3's ...

note

[May-June 2026] Local LLM Observation Log: Two Months of Specialized 4B Medical Models and ...

This series is a fixed-point observation record of local LLMs for hospital pharmacists. Every month, I collect information from multiple AIs using the same research prompt, adding primary source ...

GitHub

louisfb01/start-ai-engineering

4️⃣ Hugging Face - Official tutorials across the open-source AI ecosystem. Covers fine-tuning, inference, datasets, and new model releases. 4️⃣ Prompt Engineering - Practical prompt engineering and AI ...

一些您可能无法访问的结果已被隐去。

显示无法访问的结果

MiniMax-M3-Tutorial.md

[May-June 2026] Local LLM Observation Log: Two Months of Specialized 4B Medical Models and ...

louisfb01/start-ai-engineering

今日热点