Visual Encodings - Search News

EncQA: Benchmarking Vision-Language Models on Visual Encodings for Charts

Abstract: Multimodal vision-language models (VLMs) continue to achieve ever-improving scores on chart understanding benchmarks. Yet, we find that this progress does not fully capture the breadth of ...

GitHub

VAR: a new visual generation method elevates GPT-style models beyond diffusion & Scaling laws observed

🕹️ Try and Play with VAR! We provide a demo website for you to play with VAR models and generate images interactively. Enjoy the fun of visual autoregressive modeling! We provide a demo website for ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

EncQA: Benchmarking Vision-Language Models on Visual Encodings for Charts

VAR: a new visual generation method elevates GPT-style models beyond diffusion & Scaling laws observed

Trending now