
Ramen: Robust Test-Time Adaptation of Vision-Language Models with Active Sample Selection
Researchers introduce Ramen, a test-time adaptation framework that improves vision-language models like CLIP when facing mixed-domain data shifts. The method uses active sample selection to retrieve relevant batches for each test sample, addressing a practical gap where existing approaches assume single-domain test distributions.52




























