Data Engineer Interview Questions
- 0of 0 votes
AnswerWhat are Parameters, and types of Parameters? How do linking Parameters with Datasets work? How to create parameters for Data sources? How to creating parameters for file paths.
- Ashish Roe August 31, 2023 in United States| Report Duplicate | Flag | PURGE
Accenture Data Engineer Algorithm - 0of 0 votes
AnswersEmployee Salary Department
- 11gupt October 10, 2020 in United States
A 1000 IT
B 2000 IT
C 3000 IT
D 4000 HR
E 2000 HR
F 1500 IT
G 7000 HR
Write a query to get results like below -
Employee Salary Next highest salary(in same department) Department
A 1000 2000 IT
B 2000 3000 IT
.
.
E 2000 4000 HR| Report Duplicate | Flag | PURGE
Data Engineer - 0of 0 votes
AnswersHouse No Members
- 11gupt October 10, 2020 in United States
1100 John, mary, kim, ash
1101 Dan, Roger, kee
Write a query to get in below format-
House No Person
1100 john
1100 mary
1100 kim
1100 ash
1101 Dan
1101 roger
1101 kee| Report Duplicate | Flag | PURGE
Data Engineer SQL - 0of 0 votes
AnswersIf ( a = True){
- 11gupt October 10, 2020 in United States
If (b = True){
Approve credit card;
}
Else {
Disapprove credit card;
}
}
Else {
If(C=False){
IF(D=True){
Approve Credit card;
}
else {
Diapprove credit card;
}
}
}
Simplify above written code.| Report Duplicate | Flag | PURGE
Data Engineer Java - 0of 0 votes
AnswersWhile designing a Table in database, how will you determine its performance?
- 11gupt October 10, 2020 in United States| Report Duplicate | Flag | PURGE
Data Engineer SQL - 0of 0 votes
AnswersImplement union and intersection of two array(in a efficient way).
- dhruvjoshi43 October 04, 2019 in india for none
Additional information given by the interviewer was: elements of the two given array may be repeated but cannot be repeated in union and intersection array.| Report Duplicate | Flag | PURGE
Amazon Data Engineer test - 0of 0 votes
Answer/*
- trish March 19, 2019 in United States
## Setup
The flow of a dispute is as follows:
- A charge is created by an end customer.
- Stripe receives a dispute record from the bank.
- The business responds with evidence.
- If no second dispute is received within 30 days after evidence submission, the dispute is won. If a second dispute is received, the dispute is lost.
Charge
(Maybe) Dispute Record
(Maybe) Evidence submission
(Maybe) Second Dispute Record
The raw tables generated from the API look like:
```
Charges
+---------------+-----------+
| charge_id | varchar |
| created | timestamp |
| amount | int |
| seller_id | varchar |
| customer_id | varchar |
+---------------|-----------+
Dispute Records
+----------------+-----------+
| dispute_id | varchar |
| created | timestamp |
| charge_id | varchar |
+----------------|-----------+
Evidence Submission
+-------------------+-----------+
| evidence_id | varchar |
| created | timestamp |
| charge_id | varchar |
+-------------------|-----------+
```
*/
/*
1. Can you design a unified dispute table that would allow us to compute things like the win rate, dispute rate, evidence submission rate etc?
*/| Report Duplicate | Flag | PURGE
Strip Data Engineer SQL - 0of 0 votes
Answerscreate a custom feature transformer in spark scala.Lets say dataframe is like below
- ashwini.padhy89 December 03, 2018 in India
+--------------------+ .
| email_list| .
+--------------------+ .
|testmail1115@gmail.com| .
|mavenmaven@mlail.com| .
|dnd.7899334622@gmail.com| .
+--------------------+ .
If i use the transformer it converts the input array of strings into an array of n-grams.like below:
+--------------------+--------------------+
| email_list| ngrams| .
+--------------------+--------------------+
|testmail1115@gmail.com|[t e, e s, s t, t...|
|mavenmaven@mlail.com|[m a, a v, v e, e...| .
|dnd.7899334622@gmail.com|[d n, n d, d...| .
+--------------------+--------------------+ .
How to get the distinct ngram present rather the pattern or array .| Report Duplicate | Flag | PURGE
StartUp Data Engineer - 0of 0 votes
AnswersYou have two files in hdfs one having date range with two columns start date and end date and another having two column with date and visitors field. You have to write a spark code which gives date range having maximum no. of visitors using that two files.
- tokritijain October 30, 2018 in India| Report Duplicate | Flag | PURGE
Amazon Data Engineer - 1of 1 vote
AnswerGiven an array, find the number of tuple such that A [i] + A [j] + A [k] = A [l] in an array, where i <j <k <l.
- ajay.raj January 26, 2018 in United States| Report Duplicate | Flag | PURGE
Google Data Engineer - 0of 0 votes
AnswersThere are three numbers a, b, and c. the product of any two numbers is equal to the third number. For example a*b=c or b*c=a or a*c=b. Then what are the possible a, b and c values?
- D PRAVEEN KUMAR January 23, 2018 in India| Report Duplicate | Flag | PURGE
Skill Subsist Impulse Ltd Data Engineer General Questions and Comments - 0of 0 votes
AnswersGive you a 2xN board and two kinds of tiles: 1x2 (two squares across), 2x1 (two squares up) Ask how many ways you can fill the board.
** ** * * * * ** **
Follow up is the new four kinds of tiles: L shape in different angle, , ask you how many kinds of tiles are now six
- ajay.raj January 23, 2018 in United States| Report Duplicate | Flag | PURGE
Google Data Engineer - 0of 0 votes
Answersgive a binary matrix, 0 on behalf of the sea, 1 on behalf of the land, the val also represents the height of the altitude, if a cell is originally on land and is also surrounded by eight neighbor are on land, that cell become 2, each cell and its eight neighbor elevation cannot differ by more than 1. Return to the highest altitude can take altitude (special case is if the entire matrix is 1, then it is unlimited)
- ajay.raj January 23, 2018 in United States| Report Duplicate | Flag | PURGE
Google Data Engineer - 0of 0 votes
AnswersGive a weighted n-nary tree and find the longest path from the root node to the leaf node
- ajay.raj January 21, 2018 in United States
class Node {
int id;
// connected node id, edge weight
Map <Integer, Integer> edges;
}| Report Duplicate | Flag | PURGE
Google Data Engineer - 0of 0 votes
AnswersGiven a binary matrix, count the number of square that can be formed by all 0s
- ajay.raj January 20, 2018 in United States| Report Duplicate | Flag | PURGE
Google Data Engineer - 0of 0 votes
Answersgiven a string p, called order, such as abc, means a in front of b, and so on
- ajay.raj January 20, 2018 in United States
given a second string s, to determine whether it is follow the order of p, return boolean,
example If aaa return true,
If cba is false
If aaxyc is true, the letters that have not been seen in the order are skipped| Report Duplicate | Flag | PURGE
Google Data Engineer - 0of 0 votes
AnswersMS FTE Question:
- mktauseef October 04, 2017 in United States
Find the gap from 1,2,5,6,10
Answer : 3,4,7,8,9| Report Duplicate | Flag | PURGE
Data Engineer Database - 0of 0 votes
AnswersDesign a system to find top 10 twitter hashtags in the most recent 1 min, 10 min, 1 hr
- AlgoBaba August 28, 2017 in United States| Report Duplicate | Flag | PURGE
Twitter Data Engineer Software Design - 0of 0 votes
Answersnumber_one = "193283492420348904832902348908239048823480823"
- shopatlemo July 01, 2017 in United States
number_two = "3248234890238902348823940990234"
Question:
1) I need to multiply this and get the answer
2) DO NOT CONVERT TO INT AND DO THE MULTIPLICATION| Report Duplicate | Flag | PURGE
Facebook Data Engineer Python - 0of 0 votes
AnswersI have two tables
- shopatlemo July 01, 2017 in United States
Supplier Table:
Supp_id
supp_name
Invoice Table:
inv_id
supp_id
inv_date
inv_amt
payment_date
paid_amt
I want to list the invoice(s) that have highest invoice_amt for the year 2016.
DO NOT USE MIN/MAX function| Report Duplicate | Flag | PURGE
Facebook Data Engineer SQL - 4of 4 votes
AnswersGroup by with having related questions. ER provided was customer table.
- harshvp April 12, 2017 in United States for Search| Report Duplicate | Flag | PURGE
Facebook Data Engineer Database - 0of 0 votes
AnswersFind the % of all male customers in a specific area out of all the customers in that area.
- harshvp April 12, 2017 in United States for Search| Report Duplicate | Flag | PURGE
Facebook Data Engineer Database - 0of 0 votes
AnswersGet total number of all the departments of each employees
- harshvp April 12, 2017 in United States for Search| Report Duplicate | Flag | PURGE
Facebook Data Engineer Database