SOFTWAREHugging Face
Multimodal Generation
Any-to-any multimodal generation in Transformers
Hugging Face Transformers task guide and pipeline for any-to-any multimodal generation, supporting combinations of text, image, audio, and video inputs and outputs.
Recent stories
0 linked stories
No linked stories yet.