For anyone interested or working on multi-modality learning (video, image, audio, language...)
46 people · by @roymiles.bsky.social