PIXAR: Auto-Regressive Language Modeling in Pixel Space
Published in ACL Findings, 2024
We present PIXAR, the first pixel-based autoregressive LLM that understands and generates text-in-images, reaching GPT-2–level performance without relying on symbolic tokenization.