Research2025-09-25
KV-cache compression extends context cheaply
New methods shrink the memory cost of long context, letting smaller GPUs handle bigger windows.
Source: arxiv.org
GPU and AI news from September 2025. Back to 2025 or the latest.
New methods shrink the memory cost of long context, letting smaller GPUs handle bigger windows.
Source: arxiv.org
An open image model with strong text rendering and edit-by-instruction control.
Source: github.com