**Genpact** Interview Questions for a full time job as a **Business Analyst**

Process – Two telephonic Interviews

Note: The interviews were arranged by a referral.

## Interview Questions

- You have listed 4 Programming Languages – R, SQL, SAS, Python. Based on Comfort and Expertise Please Rank them.
- Can you describe any 1 project mentioned in your resume? Cross questions were asked from the projects.

- Why did you specifically use this variable? How much accuracy did you get?
- How much precision did you get?

- Please tell me a little bit about Logistic Regression.
- What are the variables that you consider in logistic regression? (Didn’t get this question…so I explained the whole logistic regression)
- What is the odds ratio? How would you define a logistic function? Which factors did you use to build the model? (From Project)
- What is AUC and ROC curve? Have you checked AUC and ROC curve after implementing the model?
- Did you use any joins in your project?
- How did you load the CSV dataset in R?
- Do you have any experience in Joins using R or SQL?
- There are 2 tables – the 1st table has 10 rows and the 2
^{nd }table has 8 rows. How many entries will I have if I left join vs a right Join? - What if the table having 8 rows (7 distinct and 1 duplicate). So, when you do a left join, what will be the number of rows we will get?
- What is the way to check the number of rows in your left table?
- If I give you this exercise of joining these 2 tables will you use count(*) after left join or is that something you usually don’t do?
- Take me through the first project you have mentioned.
- How did you find missing data from your dataset and how did you fix that issue. For Categorical values and Quantitative Case.
- Any other way you can think of Imputing missing values?
- How you check for outliers? How do you deal with Outliers?
- Do you know anything about dummy Variables?
- How you interpret the Coefficient of the dummy variable in the case of Linear Regression?
- If a variable has 5 values in it, how many dummy variables will you create?
- Which data type do you use for categorical variables?
- What are the data types available in R?
- Explain the hypothesis testing you did in your Project.
- Is it a standard to use 0.05 as a cut-off?
- Please Explain Type I and Type II errors.
- Why Type II error is more dangerous?
- Have you done any linear regression projects?
- what are the different ways of validating your linear regression model?
- Why Adjusted R
^{2}is better than R^{2}? - What is Multicollinearity in Multiple Linear Regression?
- How do you deal with Multicollinearity? What are the available methods?
- In SAS what do you know?
- You have some experience in Tableau. Tell us about that.
- Let’s say there’s a chocolate manufacturing company and they are saying they want to launch a new chocolate flavor in the market. So which flavor will you suggest?
- What are the steps you will follow? What are the questions you will ask?

