About this project

This project involved analyzing a comprehensive HR dataset encompassing various employee attributes and performance metrics. The dataset included details on employee demographics (age, gender, race, marital status, etc.), employment history (hire date, termination date, reasons for termination), compensation (salary), performance evaluations, engagement survey results, and absenteeism records. The goal was to extract meaningful insights regarding employee performance, identify factors influencing employee turnover and satisfaction, and explore potential correlations between different employee characteristics and their performance scores.

My analysis involved several key steps, including data cleaning, exploratory data analysis, and statistical modeling. I explored relationships between performance scores and salary, tenure, demographics, department, and management styles. Additionally, I investigated the factors contributing to employee attrition and employee satisfaction levels. The results were visualized using various charts and graphs to facilitate clear communication and interpretation.

Task to be done

I. Performance and Compensation
Task: Investigate the correlation between PerformanceScore and Salary.
Question: Do higher performers receive higher salaries? The impact of SpecialProjectsCount have to be cincidered.

II. Performance and Demographics
Task: Analyze if there’s a relationship between PerformanceScore and demographic factors like Sex, RaceDesc, HispanicLatino, or Age (calculated from DOB).
Be cautious about interpreting these results. Focus is on identifying the potential disparities, not making causal claims.

III. Absenteeism and Performance
Task: To examine the correlation between Absences and PerformanceScore.
Question: Do employees with higher absences tend to perform worse? Consider also the interaction with DaysLateLast30.

IV. Departmental Performance
Task: To compare the average PerformanceScore across different Departments.
Question: Are certain departments consistently outperforming others?

V. Recruitment Source and Performance
Task: To analyze if the RecruitmentSource has an impact on PerformanceScore or employee retention (Termd).

VI. Tenure and Performance
Task: To explore if employee tenure (time in the company, calculated from Date ofHire) correlates with PerformanceScore or EmpSatisfaction.

VII. Marital Status and Performance
Task: To investigate if MaritalStatusID or MaritalDesc is related to PerformanceScore or Absences.

VIII. Manager Impact
Task: To analyze if employees under certain managers (ManagerName or ManagerID) exhibit different performance patterns.

Other insights to explore

Employee Turnover
Task: To analyze the TermReason and DateofTermination to understand why employees leave the company. Segment the reasons and explore potential trends.

Diversity and Inclusion
Task: To analyze the representation of different demographic groups (RaceDesc, HispanicLatino, Sex) within the company. To calculate representation ratios for each department.

Employee Engagement and Satisfaction
Task: To explore relationships between EngagementSurvey, EmpSatisfaction, PerformanceScore, and Absences.

Salary Distribution
Task To analyze the distribution of Salary across departments, positions, and performance levels. To Check for potential pay gaps.

Time Series Analysis
Task: To use Date of Hire, LastPerformanceReview_Date, and DateofTermination for time-series analyses to spot trends in hiring, performance, and turnover.

Tools to be used

Python libraries like Pandas, NumPy, Scikit-learn, and visualization libraries (Matplotlib, Seaborn) will be helpful. Google project IDX Cloud SaaS will used for editing.

Dataset sample

Employee_NameEmpIDMarriedIDMaritalStatusIDGenderIDEmpStatusIDDeptIDPerfScoreIDFromDiversityJobFairIDSalaryTermdPositionIDPositionStateZipDOBSexMaritalDescCitizenDescHispanicLatinoRaceDescDateofHireDateofTerminationTermReasonEmploymentStatusDepartmentManagerNameManagerIDRecruitmentSourcePerformanceScoreEngagementSurveyEmpSatisfactionSpecialProjectsCountLastPerformanceReview_DateDaysLateLast30Absences
Adinolfi, Wilson K10026001154062506019Production Technician IMA19607/10/1983M SingleUS CitizenNoWhite7/5/2011N/A-StillEmployedActiveProduction Michael Albert22LinkedInExceeds4.6501/17/201901
Ait S, Karthikeyan 100841115330104437127Sr. DBAMA21485/5/1975M MarriedUS CitizenNoWhite3/30/20156/16/2016career changeVoluntarily TerminatedIT/ISSimon Roup4IndeedFully Meets4.96362/24/2016017
Akinkuolie, Sarah10196110553064955120Production Technician IIMA18109/19/1988FMarriedUS CitizenNoWhite7/5/20119/24/2012hoursVoluntarily TerminatedProduction Kissy Sullivan20LinkedInFully Meets3.02305/15/201203
Alagbe,Trina10088110153064991019Production Technician IMA18869/27/1988FMarriedUS CitizenNoWhite1/7/2008N/A-StillEmployedActiveProduction Elijiah Gray16IndeedFully Meets4.84501/3/2019015
Anderson, Carol 10069020553050825119Production Technician IMA21699/8/1989FDivorcedUS CitizenNoWhite7/11/20119/6/2016return to schoolVoluntarily TerminatedProduction Webster Butler39Google SearchFully Meets5402/1/201602

Let's
work together.

more about me

Credentials

Scroll to Top