Ho presentato la mia candidatura tramite un selezionatore. Ho sostenuto un colloquio presso Birlasoft
Colloquio
Basic Data warehouse concepts , Pyspark internal and Sql Questions joins and 5th highest salary, windows functions, hive internal and external tables,denserrank, rank, group by reduce by, hadoop and hdfs
What you’re describing sounds like an interviewer who was checking boxes rather than evaluating thinking, which is a weak interviewing style. When someone just fires questions and looks for their expected keywords, they’re not really assessing your real capability—especially for a role like data engineering where reasoning matters more than memorized answers.
Domande di colloquio [1]
Domanda 1
SCD. ADF, PySpark, delta format, write and schema evolution
Ho presentato la mia candidatura tramite un selezionatore. La procedura ha richiesto un giorno. Ho sostenuto un colloquio presso Birlasoft (Hyderabad) nel mese di mar 2021
Colloquio
I received a call from the recruiter and the interview was scheduled the very next day. It was a video call with someone from the Team I was interviewing for. The Interview was divided into 3 parts (1 - Python related questions, 2- ETL/Data Warehouse related questions and 3- AWS related questions)
Domande di colloquio [2]
Domanda 1
PYTHON
------------
1) What is the use of Class Method?
2) What is the Garbage Collection?
3) What are data structures in python?
4) What is the difference between Boto resource and Boto Client Object?
ETL/DWH
-------------
1) What is the difference between Data Lake and Data WareHouse?
2) Can you pull data from multiple sources for Glue? S3, csv files etc
3) Is Teradata RDBMS or NoSQL DB?
4) Difference between RDBMS and NoSQL DB?
5) Which DB is better for storing redundant data?
6) Difference betwen denormalized RDBMS and NoSQL for storing redundant data?
AWS
------
1) Why is AWS Glue Used? Is it used to create a Data Lake or Data Warehouse?
2) What is Cold Start Procedure in Lambda?
3) What is the maximum exection time of Lambda?
4) How to execute a 40 minute job using Lambda?
5) How to compress an image file using Lambda?
6) Assume a S3 Bucket hosted in another account which is private and I need to access the file path in
that S3 Bucket using a another aws account
7) Vertical vs Horizontal scaling in AWS