Domanda di colloquio di Apple

What is KV cache ? how does it help in LLM inference ?