📝 Text, for tasks like text classification, information extraction, question answering, summarization, translation, and text generation, in over 100 languages. 🖼️ Images, for tasks like image ...
The model supports both text and audio outputs, if users do not need audio outputs, they can set enable_audio_output in the from_pretrained function. This option will save about ~2GB of GPU memory but ...
一些您可能无法访问的结果已被隐去。
显示无法访问的结果