Kaggle - Titanic - Machine Learning from Disaster | 潘泰宇’s Portfolio

Jobs
Job Search
Explore all available job openings across industries and locations.
Company Search
Find your dream jobs categorized by company names.
Themed Jobs
Discover job opportunities organized by specific themes or industries.
Download our App
Tools
Resume
Create your job-winning resume using our free resume builder.
Portfolio
Showcase your skills and projects with a professional portfolio.
Resume
Create your job-winning resume using our free resume builder.
Resume Builder
Make a resume for free.
Resume Templates
Access our extensive library of professional & ready-to-use templates.
Resume Examples
Get inspired by real resume examples to create your own.
Occupation Guide
Access resume writing guides tailored for different professions.
Resume Help
Get expert advice on all things resume from our team of recruitment specialists.
Portfolio
Showcase your skills and projects with a professional portfolio.
Portfolio Maker
Create a professional portfolio to highlight your skills and projects.
Portfolio Gallery
Browse through our collection of real portfolios for inspiration and networking.
Resources
Articles
Read insightful articles on career development, job search strategies, and more.
View All Articles
Job Search Guide
Resume & CV
Cover Letter
Portfolio
Interview Skills
Job Search Tips
Industry & Job Overview
Career Guidance
Career Planning
Career Tools
Career Development
Personal Branding
Success Stories
Success Stories
Business Excellence
People Operations
Recruitment & HR
About CakeResume
People & Culture
News & Updates
Events
Featured Reads
Resume & CV
What to Write in an Email When Sending a Resume [+ Examples & Tips]
Read More
Hire
Talent Search
Find Resumes.
Job Posting
Start for Free.
Recruitment Service
Acquire Talent.
Employer of Record (EOR)
Empower Your Business in Taiwan.
Employer Branding
Build and promote your employer brand.
Pricing
Job Posting Plans
Talent Search Plans
Resume Builder Plans
Build your Network
My Network
Access your personal network connections and manage your contacts.
CakeResume Meet
Expand your professional network by meeting and connecting with other users.
Community
Engage with other users through discussions, forums, and networking events.
Download our App

Build your Network

Access your personal network connections and manage your contacts.

CakeResume Meet

Expand your professional network by meeting and connecting with other users.

Engage with other users through discussions, forums, and networking events.

Kaggle - Titanic - Machine Learning f...

Kaggle - Titanic - Machine Learning from Disaster

Kaggle - Titanic - Machine Learning from Disaster

Software QA Engineer

・

Taichung City, Taiwan

Kaggle - Titanic - Machine Learning from Disaster

Titanic - Machine Learning from Disaster

這是由Kaggle所舉辦的機器學習練習賽，提供訓練資料，參賽者回傳預測結果，將顯示正確率及排名。

正確率78.229，總排名為2548/14990

在總參賽隊伍前17%。

規則如下：

使用kaggle所提供的train.csv作為訓練資料
預測test.csv資料中乘客最後是否存活
將結果輸出為乘客ID、是否存活兩項資料並回傳

訓練方式：

資料預處理
姓名特徵中包含Ms, Mr, Dr 等稱謂，將其分割出來並捨去姓名作為特徵使用
房號特徵捨去後面數字，只保留第一碼英文，遺漏值皆設為新的值'X'
年齡特徵遺漏值放入平均年齡
登船港口遺漏值設最多人登船的港口
將影響較小特徵捨去，包含乘客ID,家人數量,姓名
性別、登船港口、傳票等級、稱謂、房號做Label Encoding
訓練模型
使用GridSearchCV找出最佳超參數
使用cross_val_score測試模型準確率 # 比較模型差異，此次4種皆上傳測試
找到最佳超參數後帶入模型，即可預測test資料
測試其他模型
此次使用SVC, RandomForest, XGBoost, KNN 四種模型
重複進行2.流程訓練模型並比較準確率
預測test結果
四種模型分別對test進行預測
使用CalibratedClassifierCV將兩個以上模型的結果合併，產生新的模型並預測結果

資料預處理

姓名特徵中包含Ms, Mr, Dr 等稱謂，將其作為新特徵使用並將姓名刪除
登船港口為Ｓ人數最多
比較每個特徵與目標的相關係數，將影響較少特徵刪除

原始資料及處理後的資料如下

訓練模型方法如下，先找出最佳超參數後再用來建模型

個別使用四個模型預測test
使用CalibratedClassifierCV混合使用兩個以上模型進行預測

各個模型預測結果如下，目前屬Random Forest準確率最高。

由Kaggle所舉辦的機器學習練習賽，提供訓練資料，參賽者需回傳預測結果，將顯示正確率及排名。正確率78.229，總排名為2548/14990 在總參賽隊伍前17%。

Please login to comment.

Published: Oct 20th 2023

66

9

0

Tools

python

Python

xgboost

random forest

knn

svc

machine learning

python

Kaggle

Share

Other works from 潘泰宇

Cover of Employee-clock-in-system.

Employee-clock-in-system

Cover of Multiple Linear Regression.

Multiple Linear Regression

Cover of AI影像辨識飲食紀錄程式.

AI影像辨識飲食紀錄程式

Cover of MultipleLinearRegression_hand_engraved.

MultipleLinearRegression_hand_engraved

Cover of vocabulary-game.

vocabulary-game

Cover of Simple Linear Regression.

Simple Linear Regression