Tags

SLK
Cross-Modal Benchmark
Image Caption
Image-Text Retrieval
Imate-Text Matching
Vision-Language Model
Alpha Zero