“LLM decoding is bottlenecked for large batches and long contexts by loading the key-value (KV) cache from high-bandwidth memory, which inflates per-token latency, while the sequential nature of ...
Joint source-channel coding/decoding (JSCC/JSCD) techniques have become state-of-the-art and one of the challenging research subjects in the spatial communication area. This paper addresses the basic ...