Dataset Transformation with Tableau

Jobs
Job Search
Explore all available job openings across industries and locations.
Company Search
Find your dream jobs categorized by company names.
Themed Jobs
Discover job opportunities organized by specific themes or industries.
Download our App
Tools
Resume
Create your job-winning resume using our free resume builder.
Portfolio
Showcase your skills and projects with a professional portfolio.
Resume
Create your job-winning resume using our free resume builder.
Resume Builder
Make a resume for free.
Resume Templates
Access our extensive library of professional & ready-to-use templates.
Resume Examples
Get inspired by real resume examples to create your own.
Occupation Guide
Access resume writing guides tailored for different professions.
Resume Help
Get expert advice on all things resume from our team of recruitment specialists.
Portfolio
Showcase your skills and projects with a professional portfolio.
Portfolio Maker
Create a professional portfolio to highlight your skills and projects.
Portfolio Gallery
Browse through our collection of real portfolios for inspiration and networking.
Resources
Articles
Read insightful articles on career development, job search strategies, and more.
View All Articles
Job Search Guide
Resume & CV
Cover Letter
Portfolio
Interview Skills
Job Search Tips
Industry & Job Overview
Career Guidance
Career Planning
Career Tools
Career Development
Personal Branding
Success Stories
Success Stories
Business Excellence
People Operations
Recruitment & HR
About CakeResume
People & Culture
News & Updates
Events
Featured Reads
Resume & CV
What to Write in an Email When Sending a Resume [+ Examples & Tips]
Read More
Hire
Talent Search
Find Resumes.
Job Posting
Start for Free.
Recruitment Service
Acquire Talent.
Employer of Record (EOR)
Empower Your Business in Taiwan.
Employer Branding
Build and promote your employer brand.
Pricing
Job Posting Plans
Talent Search Plans
Resume Builder Plans
Build your Network
My Network
Access your personal network connections and manage your contacts.
CakeResume Meet
Expand your professional network by meeting and connecting with other users.
Community
Engage with other users through discussions, forums, and networking events.
Download our App

Build your Network

My Network

Access your personal network connections and manage your contacts.

CakeResume Meet

Expand your professional network by meeting and connecting with other users.

Community

Engage with other users through discussions, forums, and networking events.

Portfolios

Charee

Dataset Transformation with Tableau

ByCharee

International Marketing and Sales Intern

・

New Taipei City, Taiwan

Data Preparation

df <- CleandLaptopData |>

tibble()

## Due to having too many missing values in the display size, I decided to replace it with average values

mean_dps <- df |>

select(display_size) |>

filter(display_size != "Missing") |>

unlist() |>

as.numeric()

mean_dps <- as.character(round(mean(mean_dps), 2))

## To select important product specifications that can be used for analyzing

df <- df |>

select(brand, model,

processor_brand, processor_name,

ram_gb, ram_type, ssd, hdd,

os, os_bit, graphic_card_gb,

weight, display_size, Touchscreen,

latest_price, star_rating) |>

mutate(display_size = replace(

display_size, display_size == "Missing", mean_dps))

## Converting to the suitable data types

df$display_size <- as.double(df$display_size) ##double for display_size

df[, -c(13, 15, 16)] <- lapply(df[, -c(13, 15, 16)], as.factor) ##factor for others

The purpose of analyzing this time is to find the preferred components that customers gave a rating more than or equal to 4.0 stars. Instead of simply sorting from the ranking, I chose to sort from 5 brands that have the most product lists in this dataset to get enough information and bring to preferred factors accurately.

## To sort the most numbers of 5 brands in the dataset

freq_b <- table(df$brand)

sorted_b <- sort(freq_b, decreasing = TRUE)

print(names(sorted_b)[1:5])

## Results

[1] "ASUS" "DELL" "Lenovo" "HP" "acer"

# convert brands to lowercase

df$brand <- as.factor(tolower(df$brand))

## To sort by rating

df <- df[order(df$star_rating, decreasing = TRUE), ]

## To filter only top five frequent brands

top5df <- df |>

filter(brand %in% c("asus", "dell", "lenovo", "hp", "acer"))

85% of this dataset was recorded by these five brands; Asus, Dell, Lenovo, HP, and Acer accordingly.

## To find processor and ram type in products that get rating >= 4.0

pop_pcsr <- top5df |>

filter(star_rating >= 4.0) |>

select(3,4,6) |>

group_by(processor_brand, processor_name, ram_type) |>

summarise(n=n(), .groups = "drop") |>

arrange(desc(n))

Laptops that got more than or equal to 4.0 ratings from customers have processor models including Ryzen 5,7,3, and 9 from AMD. Meanwhile, processors from Intel including Core i5,i3, and i7 that are the most voted processors from filtered data.

## To analyze suitable display size data

## Average and quartile display size

fstar <- top5df |>

filter(star_rating >= 4.0)

## Average display size

avg_dis <- fstar |>

summarise(avg_dps = mean(display_size))

## Results of average display size

1 15.2

## Quantile value of display size

q_dis <- fstar |>

select(display_size)

q_dis <- quantile(q_dis, probs = c(0, .25, .5, .75, 1), na.rm = T)

## Results of all five brands

0% 25% 50% 75% 100%

13.00 15.12 15.12 15.60 17.30

For the touchscreen function, data reflects that customers do not prioritize that much. Even each laptop does not have a touchscreen for 87.90%, they still got the mentioned rating star.

## Touchscreen ratio

t_dis <- fstar |> count(Touchscreen, sort = T) |>

mutate(percent = n/sum(n)*100)

This dataset did not specify the currency of all products. For the average price is $67,144.00, meanwhile, the first to the third quartiles are $44,517.25 to $75,997.50 that still got the mentioned rating star.

## Average latest product prices

avg_p <- fstar |>

summarise(avg_p = mean(latest_price))

## Result for average latest product prices

1 67144

## Quantile value of latest product prices

q_p <- fstar |>

select(latest_price)

q_p <- quantile(q_p, probs = c(0, .25, .5, .75, 1), na.rm = T)

## Results of all five brands

0% 25% 50% 75% 100%

19990.00 44517.25 59490.00 75997.50 441990.00