An open-source multimodal GUI agent from ByteDance for controlling computer interfaces and completing tasks through visual and language understanding.

Recent stories
1 linked story
An open-source multimodal GUI agent from ByteDance for controlling computer interfaces and completing tasks through visual and language understanding.
