Domanda di colloquio di Appier

Why vanishing and exploding gradient happens in RNN? What is CRF? How do you learn embeddings for users based on their visits on websites?