Removed 10 training images from v1 and captioned the remaining 30 (simple short captions by hand, not BLIPped or whatever).
[PUBLISHEDTOCIVITAIONLY]
Removed 10 training images from v1 and captioned the remaining 30 (simple short captions by hand, not BLIPped or whatever).
[PUBLISHEDTOCIVITAIONLY]