Domanda di colloquio di Zinga!

What is gradient decent?? Explain Transformers in NLP?