The Framingham dataset has been successfully loaded. Here are the first few columns of the dataset:
- male: Gender of the participant (1 = Male, 0 = Female)
- age: Age of the participant
- education: Education level of the participant
- currentSmoker: Whether the participant is a current smoker (1 = Yes, 0 = No)
- cigsPerDay: Number of cigarettes smoked per day
- BPMeds: Whether the participant is on blood pressure medication (1 = Yes, 0 = No)
- prevalentStroke: Whether the participant has had a stroke (1 = Yes, 0 = No)
- prevalentHyp: Whether the participant has hypertension (1 = Yes, 0 = No)
- diabetes: Whether the participant has diabetes (1 = Yes, 0 = No)
- totChol: Total cholesterol level
- sysBP: Systolic blood pressure
- diaBP: Diastolic blood pressure
- BMI: Body Mass Index
- heartRate: Heart rate
- glucose: Glucose level
- TenYearCHD: Whether the participant developed coronary heart disease in the next 10 years (1 = Yes, 0 = No)
Next, let's analyze the correlation between these factors and the risk of developing heart disease (TenYearCHD).