SAS Analyst Interview Questions

Hiring Manager Guide: Commercial PD Modeling Team

Interviewer's Preamble: "Thank you for coming in today. As you know, our team is responsible for developing, maintaining, and validating Probability of Default models for our commercial loan portfolios. SAS is our primary tool for data manipulation, analysis, and modeling. These next few questions are designed to help me understand your practical SAS skills and your approach to solving problems we face every day."

Question 1: The Foundational Merge

"Imagine you have two datasets. CUSTOMER_MASTER contains unique customer information like CustomerID, IndustryCode, and YearsInBusiness. LOAN_PERFORMANCE contains monthly performance data for every loan, with fields like LoanID, CustomerID, SnapshotDate, and DaysPastDue. How would you combine these two datasets to bring the customer's IndustryCode into the loan performance table? Please describe how you might do this using both a DATA Step MERGE and PROC SQL."

What I'm looking for:

Question 2: Filtering Data with Nuance

"What is the practical difference between using a WHERE statement and an IF statement to filter observations within a DATA step? When would you choose one over the other?"

What I'm looking for:

Question 3: Practical Feature Engineering

"Let's say you have the SnapshotDate and the OriginationDate for each loan. You need to create a new variable called LoanAgeMonths. How would you code this in a DATA step?"

What I'm looking for:

Question 4: Handling Missing Data

"In our commercial portfolio data, financial statement variables often have missing values. What are some SAS techniques you would use to identify and then handle these missing values before modeling?"

What I'm looking for:

Question 5: The Workhorse of Aggregation

"You need to create a summary table that shows the total exposure, average risk rating, and number of loans for each IndustryCode. How could you accomplish this using PROC MEANS and then using PROC SQL?"

What I'm looking for:

Question 6: The Modeling Procedure

"For building a PD model, PROC LOGISTIC is our standard tool. Can you describe three key options or statements within this procedure that you would use, and what information you would look for in the output to assess the initial model?"

What I'm looking for:

Question 7: Automation with Macros

"Describe a situation where you would use a SAS Macro. Can you explain the difference between %LET and CALL SYMPUTX for creating macro variables?"

What I'm looking for:

Question 8: Performance on Large Datasets

"Our portfolio data can be very large, with millions of rows. Your code is running too slowly. What are the first three things you would investigate to improve its performance?"

What I'm looking for:

Question 9: Advanced Data Manipulation

"You receive a dataset where a customer's payment history is in a 'wide' format, with columns like Pay_Status_Jan, Pay_Status_Feb, etc. For modeling, you need this in a 'long' format. How would you accomplish this transformation in SAS?"

What I'm looking for:

Question 10: The Full Picture Scenario

"Walk me through the high-level SAS steps you would take to prepare an analysis-ready dataset for a PD model, starting from the raw tables we discussed earlier. You don't need to write the exact code, but describe the procedures and data steps you would use in order."

What I'm looking for: