Tags

PKConv
SLK
Cross-Modal Benchmark
Image Caption
Image-Text Retrieval
Imate-Text Matching
Vision-Language Model