Domanda di colloquio di Neural Magic

Speeding up an already cuda kernel, proposing some optimizations.