Flexattention for efficient high-resolution vision-language models

Published in ECCV, 2024