Analysis

Key Takeaways

  • For our analysis, we have selected the Turkey Health Survey topic from the Turkey Statistical Institute. This survey is a research project with the aim of understanding the general state of health and the collection of data on key health indicators in order to be able to measure the development of the country. The Survey provides international comparisons and helps to identify national health needs.

  • Our data set contains a wide range of health-related information in Turkey. This information includes health habits, health care use, prevalent diseases and demographic details. As a team, we analysed health patterns among different age and gender groups, identified variations in health behaviors and disease occurrence, and understood health service utilization. Providing valuable insights for health policies and activities is our ultimate goal.

  • The data, especially when it is visualized such as below, provides us with a good understanding of the situation particularly with respect to age and sex. When we look through the various tables and plots we can see that more women seem to suffer from various health issues than men while men reportedly consume more alcohol than women. This kind of distinction not only lead us to think about sociological and cultural factors, and oppression; it also brings light to a much bigger issue in the medical field that not a lot of people consider: Women were excluded from being subjects for medical research until late 1980s. This source says that only in 1986, NIH established a policy that encouraged researchers to include women in their studies. In 1989, this inclusion encouragement was extended to minorities as well.

  • Too much alcohol can raise blood pressure and weight, increasing risk of a heart attack, stroke and type 2 diabetes, says British Heart Foundation We can actually observe this by looking at Data Set 1’s Plot 1 and Data Set 2’s implications of higher alcohol usage by men. In Data Set 1 - Plot 1 the only categories where numbers for men surpasses those of women are in fact, heart attack and stroke.

1 Dataset 1 : The Percentage of Main Diseases/Health Problems Declared by Individuals in the Last 12 Months by Sex, 2016-2022

1.1 Average Percentage by Disease with Gender Connection

Show the code
summary_data <- data_1_longer %>%
  group_by(Diseases, Gender) %>%
  summarise(mean_percentage = mean(Percentage))
abbreviate_disease_names <- abbreviate(summary_data$Diseases)
plot <- plot_ly(summary_data, 
                x = ~abbreviate_disease_names, 
                y = ~mean_percentage, 
                color = ~Gender,
                colors = c("#1E7DFF", "#FC90FF"),
                type = "bar") %>%
  layout(title = list(text="Average Percentage by Disease with Gender Connection", x = 0, y = 1),
         xaxis = list(title = "Diseases",
                      zeroline = FALSE,
                      tickangle = 45,
                      categoryorder = "trace"),
         yaxis = list(title = "Percentage",
                      zeroline = FALSE))

plot

One look at this plot and we can tell that women are more commonly reported to have health problems than men. Two categories are exceptions: Myocardial infarction, also known as a heart attack, and stroke.

1.2 Percentage by Disease with Year Connection

Show the code
abbreviate_disease_names <- abbreviate(data_1_longer$Diseases)

chart_12 <- plot_ly(data_1_longer, 
                x = ~abbreviate_disease_names, 
                y = ~Percentage, 
                color = ~as.factor(Year),
                colors = c("#49CE53", "#2B59C3","#FA7E61"),
                type = "bar") %>%
  layout(title = list(text="Percentage by Disease with Year Connection", x = 0, y = 1),
         xaxis = list(title = "Diseases",
                      tickangle = 45),
         yaxis = list(title = "Percentage"))

chart_12

This graph tells us a few key things:

  • First is the fact that 2019 was apparently a terrible year.

  • The second is that the most reported cases of health problems seem to be Low Back Disorders and Neck Problems- Which is not all that surprising when you consider the current state of world affairs, is it? People are working 9-5, hunched over desks and keyboards all day, and the situation is not very different for students, which make up about 15% of Turkey’s population according to a recent research.

  • The third one is that, interestingly enough, diabetes has seen a new high in 2022 and not 2019. There might be several reasons for that, and to make educated guesses we can put forward the lack of movement in our daily lives that got only worse after the Covid-19 pandemic. Or the fact that no one has time to consume ‘good’ food anymore. We don’t cook our own meals as often, instead opting for takeout after an exhausting shift that sucked the life out of us. Even if we do cook, we no longer can guarantee that our ingredients don’t contain a plethora of chemicals that we can’t even pronounce the names of.

1.3 Distribution of Percentage by Gender for Each Disease

Show the code
abbreviate_disease_names <- abbreviate(data_1_longer$Diseases)
box_13 <- plot_ly(data_1_longer,
              x = ~reorder(abbreviate_disease_names, -Percentage),
              y = ~Percentage,
              color = ~Gender,
              colors = c("#1E7DFF", "#FC90FF"),
              type = "box") %>% 
  layout(boxmode = "group",
         title = "Distribution of Percentage by Gender for Each Disease",
         xaxis = list(title = "Diseases",
                      zeroline = FALSE),
         yaxis = list(title = "Percentage",
                      zeroline = FALSE))
box_13

2 Data Set 2 : The Percentage of Individuals’ Status of Alcohol Use by Sex and Age Group, 2016-2022

2.1 Distribution of Alcohol Usage Percentage by Consume Status and Gender

Show the code
chart_21 <- plot_ly(data_2_longer,
              x = ~usage,
              y = ~rate,
              color = ~gender,
              colors = c("#1E7DFF", "#FC90FF"),
              type = "box") %>% 
  layout(boxmode = "group",
         title = list(text="Distribution of Alcohol Usage Percentage by Consume Status and Gender", x = 0, y = 1),
         xaxis = list(title = "Alcohol Usage",
                      zeroline = FALSE),
         yaxis = list(title = "Percentage",
                      zeroline = FALSE))
chart_21

2.2 Distribution of Alcohol Usage Percentage by Year

Show the code
chart_22 <- ggplot(data_2_longer, aes(x = year, y = rate, color = gender)) +
  geom_point(alpha = 0.7, size = 1) +
  stat_summary(fun = "mean", geom = "line", linewidth = 0.3, color = "black",
               aes(group = interaction(usage, gender)), show.legend = TRUE) +
  labs(title = "Distribution of Alcohol Usage Percentage by Year",
       x = "Year",
       y = "Percentage") +
  scale_color_manual(values = c("#1E7DFF", "#FC90FF")) +
  theme(axis.text.x = element_text(angle = 45, hjust = 1)) +
  theme_minimal() +
  facet_grid(. ~ usage, scales = "free_x", space = "free_x", switch = "both") +
  theme(panel.spacing = unit(2, "lines"))+
  scale_x_continuous(breaks = c(2016, 2019, 2022)) 

ggplotly(chart_22)

We can see that the most recent year has seen a dramatic increase in men who have never consumed alcohol, which is supported by a decrease in regular consumers and people who have consumed alcohol in the past but don’t consume anymore. The decrease in the “doesn’t consume” category seems to be more steep- Can it be the decline in the purchasing power with most recent economic disasters? Or is it the simple fact that people are choosing healthier lifestyles?

2.3 Percentage of Consumers of Alcohol Use by Sex and Age Group

Show the code
consumers_data <- data_2_longer %>%
  filter(usage == "Consumers")
consumers_data$year <- factor(consumers_data$year, levels = c(2016, 2019, 2020, 2022))
chart_23 <- ggplotly(ggplot(consumers_data, aes(x = year, y = age, fill = rate)) +
  geom_tile() +
  facet_grid(cols = vars(gender), scales = "free") +
  labs(title = "Percentage of Consumers of Alcohol Use by Sex and Age Group",
       x = "Year",
       y = "Age Group",
       fill = "Percentage") +
  scale_fill_gradient(low = "white", high = "#fc8d62") +
  theme_minimal())

chart_23

The age range for alcohol consumers seem to overlap for both men and women- We can see from this visual representation that both men and women around the ages 25-34 consume more alcohol than any other age range. Although for men, we can see the intensity of consumption seems to carry on over to the 35-44 age range as well.

3 Data Set 3 : Body Mass Index Distribution of Individuals by Sex, 2008-2022

3.2 Boxplot of Percentage by Category and Sex

Show the code
chart_33 <- plot_ly(data_3_long,
              x = ~reorder(Category, -Percentage),
              y = ~Percentage,
              color = ~Sex,
              colors = c("#1E7DFF", "#FC90FF","#8D87FF"),
              type = "box") %>% 
  layout(boxmode = "group",
         title = list(text="Boxplot of Percentage by Category and Sex", x = 0, y = 1),
         xaxis = list(title = "Category",
                      zeroline = FALSE),
         yaxis = list(title = "Percentage",
                      zeroline = FALSE))
chart_33
Back to top