CakeResume 找人才

进阶搜寻
On
4 到 6 年
6 到 10 年
10 到 15 年
15 年以上
Avatar of 陶俊良.
Avatar of 陶俊良.
資料分析師 Data Analyst @Portto 門戶科技| Blocto
2022 ~ 2024
Data Analyst、Data Engineer、Data Scientist、Customer Experience Analyst
一個月內
陶俊良 (Tao,Chun-Liang) Taipei, Taiwan Email: [email protected] Phone:I am very sensitive to data and enjoy finding inspiration and ideas from them. I am proficient in machine learning, text analysis, and recommendation systems, EVM blockchain analytics, and currently use Python as my primary programming languages. I am always open to learning new things, such as learning new data structure from blockchain. I am currently very interested in blockchain data and on-chain user segamentation. I was working in digital media, advertising (DSP, SSP, DMP platforms), gaming user analyst, blockchain
python
R
MySQL
就职中
正在积极求职中
全职 / 对远端工作有兴趣
4 到 6 年
臺灣大學
流行病學與預防醫學所 生物統計組
Avatar of the user.
Avatar of the user.
智慧製造全端開發工程師 @聯華電子股份有限公司
2022 ~ 现在
AI工程師、機器學習工程師、深度學習工程師、影像演算法工程師、資料科學家、Ai Application Engineer,Machine Learning Engineer,Deep Learning Engineer,Data Scientist
一個月內
Python
Qt
Git
就职中
正在积极求职中
全职 / 对远端工作有兴趣
4 到 6 年
元智大學 Yuan Ze University
工業工程與管理學系所
Avatar of the user.
Avatar of the user.
曾任
博士後研究員 @洛桑大學神經發育疾病實驗室
2023 ~ 2023
Data Scientist, Data Analyst, Machine Learning Engineer
一個月內
Data Science
Data Analysis
Machine Learning
待业中
正在积极求职中
全职 / 对远端工作有兴趣
4 到 6 年
洛桑聯邦理工學院(EPFL)
神經科學
Avatar of 李慕全(MuChuan Li).
Avatar of 李慕全(MuChuan Li).
曾任
Service Provider @Taron Solutions Limited
2023 ~ 2023
AI工程師、機器學習工程師、電腦視覺工程師、資料科學家、Machine Learning Engineer、Computer Vision Engineer、Data Scientist
一個月內
李慕全(MuChuan Li) 畢業於國立臺北科技大學資工所,研究領域為深度學習、電腦視覺、及影像處理。在學期間致力於應用電腦視覺技術解決交通問題,擁有多項產學合作的專案開發經驗,亦在電腦視覺領域中發表過多篇學術論文,主要研究主題包含物
Machine Learning
Computer Vision
Pytorch/Tensorflow
待业中
正在积极求职中
全职 / 对远端工作有兴趣
4 到 6 年
國立臺北科技大學
資訊工程
Avatar of the user.
AI工程師、機器學習工程師、深度學習工程師、資料科學家、Machine Learning Engineer、Deep Learning Engineer、Data Scientist
一個月內
Python
R
Natural Language Processing (NLP)
就职中
正在积极求职中
全职 / 对远端工作有兴趣
4 到 6 年
國立政治大學(National Chengchi University)
資訊科學系
Avatar of 邱義塵.
Avatar of 邱義塵.
曾任
Data Engineer @Rooit Inc. (XO App)
2023 ~ 2023
AI工程師、機器學習工程師、深度學習工程師、資料科學家、Machine Learning Engineer、Deep Learning Engineer、Data Scientist
一個月內
邱義塵 於獨角獸多媒體設計有限公司擔任 遊戲測試工程師一職 建立公司測試團隊的測試流程和撰寫自動化測試程式 SDET、AI工程師、機器學習工程師、深度學習工程師、資料科學家、Machine Learning Engineer、Deep Learning Engineer、Data Scientist 城市,TW [email protected] 工作經歷 獨角獸多媒體
Python
Data Analysis
Data Science
待业中
正在积极求职中
全职 / 对远端工作有兴趣
6 到 10 年
中國醫藥大學(China Medical University)
臨床醫學研究所
Avatar of Chun-Jung Huang.
Avatar of Chun-Jung Huang.
OPC Chief Engineer @TSMC
2020 ~ 现在
AI工程師、機器學習工程師、深度學習工程師、資料科學家、Machine Learning Engineer、Deep Learning Engineer、Data Scientist
一個月內
Chun-Jung Huang [email protected] Chiao-Tung University, Ph.D. - Photonics,2015 ~ 2020 Member of The Phi Tau Phi Scholastic Honor Society of the Republic of China. Work Experience TSMC, OPC Chief Engineer (MarPresent) ◆Introduced image anomaly detection techniques to identify and address defects in photomask manufacturing, significantly improving product quality and reducing turnaround time. ◆Managed large-scale data processing tasks, demonstrating expertise in analyzing and handling datasets of hundreds of millions, to bolster model development and optimization. ◆Excelled in distributed computing, optimizing code execution across thousands of systems to
Deep learning with TensorFlow
Translational Research
Clinical Research
就职中
正在积极求职中
全职 / 对远端工作有兴趣
4 到 6 年
National Chiao-Tung University
Ph.D. - Clinical Engineering
Avatar of 梁賦康 (Foo-Hong, Leong).
Avatar of 梁賦康 (Foo-Hong, Leong).
Product Manager @東元電機股份有限公司 (TECO Electric & Machinery Co. Ltd.)
2023 ~ 2023
Data Scientist, Data Analyst, Machine Learning Engineer
一個月內
梁賦康 (Foo-Hong, Leong) Taoyuan City, Taiwan Email: [email protected] Tel:Skills • Languages: Python • DataBases: MySQL, SQLite • Infrastructure tools: Github • Machine learning libraries: TensorFlow, Keras, and Scikit-learn • Data visualization tools: Power BI, Seaborn and Matplotlib • Deployment: Streamlit Summary I have been working in Motor Manufacturing Industry for 8 years. My first programming was going to my Bachelor's degree, C++ was the first program I learned. Then I started to learn Python in 2018 at TEDU and my first project was the Stock Trend Prediction by CNN. I kept
Python
Power BI
Data Analytics
就职中
正在积极求职中
全职 / 对远端工作有兴趣
6 到 10 年
國立成功大學 National Cheng Kung University
Mechanical Engineering
Avatar of 陳奕妤.
Avatar of 陳奕妤.
曾任
Senior Data Analyst @趨勢科技
2022 ~ 现在
Data Scientist, Data Analyst, Machine Learning Engineer
一個月內
customers by using statistical methods and machine learning methods. Developing automation regular reports, maintaining SQL store procedures, Tableau dashboards and Power BI dashboards. Cooperated with cross-functional team (Product, Marketing, Platform, PM, IT, Sales) to provide timely and accuracy business insight analysis. Developing automated web crawler on MMA website to collect ETF, fund, bond information. Skill : Microsoft SQL Server · Microsoft Power BI · Data Cubes · R · Python · Tableau · Web Crawling · machine learning · IMPALA · HIVE · Git · Docker Data Analyst • Catchplay AprOct 2020 Indonesia OTT customer profile analysis - Collecting, analyzing and evaluating data and campaign performa...
python
R
SQL
待业中
正在积极求职中
全职 / 对远端工作有兴趣
4 到 6 年
輔仁大學 Fu Jen Catholic University
統計資訊學系
Avatar of 江易倫.
Avatar of 江易倫.
曾任
Career transition @Career Break
2024 ~ 2024
NLP Engineer / Data Scientist / Machine Learning Engineer
一個月內
江易倫 Data Scientist | Python | SQL | NLP | GenAI 具備5年以上程式撰寫能力,擅長Python、SQL與Linux 擅長資料清洗、分析與分類貼標 具有自然語言處理與研究經驗 大型語言模型LLM及生成式AI訓練與使用經驗 RAG技術使用與知識庫建立經驗 過往研究專案 中華電信智能標籤案
Python
SQL
NLP
待业中
正在积极求职中
全职 / 对远端工作有兴趣
4 到 6 年
National Chengchi University
資訊科學系

最轻量、快速的招募方案,数百家企业的选择

搜寻简历,主动联系求职者,提升招募效率。

  • 浏览所有搜寻结果
  • 每日可无限次数开启陌生对话
  • 搜尋僅開放付費企業檢視的简历
  • 检视使用者信箱 & 电话
搜寻技巧
1
Search a precise keyword combination
senior backend php
If the number of the search result is not enough, you can remove the less important keywords
2
Use quotes to search for an exact phrase
"business development"
3
Use the minus sign to eliminate results containing certain words
UI designer -UX
免费方案仅能搜寻公开简历。
升级至进阶方案,即可浏览所有搜寻结果(包含数万笔览仅在 CakeResume 平台上公开的简历)。

职场能力评价定义

专业技能
该领域中具备哪些专业能力(例如熟悉 SEO 操作,且会使用相关工具)。
问题解决能力
能洞察、分析问题,并拟定方案有效解决问题。
变通能力
遇到突发事件能冷静应对,并随时调整专案、客户、技术的相对优先序。
沟通能力
有效传达个人想法,且愿意倾听他人意见并给予反馈。
时间管理能力
了解工作项目的优先顺序,有效运用时间,准时完成工作内容。
团队合作能力
具有向心力与团队责任感,愿意倾听他人意见并主动沟通协调。
领导力
专注于团队发展,有效引领团队采取行动,达成共同目标。
半年內
Data Scientist, Data Engineer
Logo of 中國信託商業銀行股份有限公司.
中國信託商業銀行股份有限公司
2021 ~ 现在
台灣台北市
专业背景
目前状态
就职中
求职阶段
目前会考虑了解新的机会
专业
数据科学家, 机器学习工程师
产业
银行, 人工智能 / 机器学习, 广告技术 / 行销技术
工作年资
4 到 6 年
管理经历
技能
Python
R
MSSQL
Scala
Linux
PyTorch
Tensorflow (Keras)
AWS
GCP
Spark
Tensorflow
pyspark
语言能力
English
进阶
求职偏好
希望获得的职位
AI工程師、機器學習工程師、深度學習工程師、資料科學家、Machine Learning Engineer、Deep Learning Engineer、Data Scientist
预期工作模式
全职
期望的工作地点
台灣台北, 台灣新北市
远端工作意愿
对远端工作有兴趣
接案服务
是,我利用业余时间接案
学历
学校
政治大學
主修科系
統計
列印
E3uoaqcxyy6dppaet0kg

許立農 | Hsu, Li-Nung


Data Scientist、Data Engineer
Taipei
[email protected]

Education

National Chenchi University, MS, Statistics, 2015 – 2017

  • GPA : 3.84 / 4.0
  • Master Thesis: Entropy Based Feature Selection, Professor Pei-Ting, Chou
    • Objective: Build a similarity matrix based on Mutual Entropy under Hierarchical Clustering. Afterwards, select clustered features as the final selection.
    • Compare the model with other feature selection methods like RF, Lasso, F-score.

Igtt7bfqhad2uml5y0ki

National Chen-Kung University, BS, Mathematics, 2011 – 2015


Kxc0f0caus5l9rwo4qji

Skills


Programing

  • Python
  • Scala
  • R
  • MSSQL


Data-related Tools

  • Tensorflow (Keras)
  • PyTorch
  • Spark
  • Docker
  • Scikit-Learn
  • Pandas


Cloud Platform

  • AWS
  • GCP


Language

  • English: TOEFL 98 / 120

Work Experience

CTBC Bank, Model Development Department, Data Scientist

2021.12 – present

  • About the department:
    • Responsible for developing models related to bank recommendations and risks, including projects such as coupon recommendations, account opening marketing lists, and fraud detection.
  • Job responsibilities:
    • Throughout the entire project lifecycle, my primary responsibilities included model design, model training, end-to-end process development, feature design, performance tracking, and method research.
Lqnpwfiwbu3f99i6zod4

Fraud Alert Project

  • Objective:
    • Predicting potential fraudulent accounts based on transaction data, restricting transactions in advance to prevent harm.
  • Responsibilities/Achievements:
    • Development and deployment of credit card and financial features.
    • Managing the data flow process from receiving variables to model predictions, identifying risk factors, and updating alert lists.
    • Implemented Autoencoder + contrastive learning to achieve a 1.81% improvement in model effectiveness.

Coupon Recommendation

  • Objective:
    • Personalized coupon recommendations for mobile banking users to increase click-through rates and redemption rates.
  • Responsibilities/Achievements:
    • Utilized multi-task learning to simultaneously predict click-through behavior and coupon redemptions, resulting in a 14% increase in click-through rate and a 74% increase in redemption rate.
    • Created performance tracking reports to monitor online model performance and provide insights to Business Units.

Financial Product Recommendations

  • Objective:
    • Tailored financial product recommendations for mobile banking users to enhance click-through rates without compromising conversion rates.
  • Responsibilities/Achievements:
    • Applied multi-task learning to jointly learn click-through and conversion behaviors, fine-tuned model architecture, achieving a 90% outperformance against competitor models in online testing.

Marketing List for Digital Savings Accounts

  • Objective:
    • Optimized conversion rates for marketing lists related to digital savings accounts
  • Responsibilities/Achievements:
    • successfully raising conversion rates from 0.23% to 1.16%

Work Experience

CLICKFORCE, Data Engineer Supervisor, 2020.1 – 2021.11

  • About the company:
    • As a top domestic digital advertisement company, CLICKFORCE cooperates with over 900 web media and over 400 mobile media to build a huge advertising environment. CLICKFORCE considers data-driven solution as the core concept of the company, and dedicates to help advertisers to achieve their commercial goals.
    • At 2020, CLICKFORCE won 2 awards at Agency & Advertiser of the Year.
    • Successfully acquire the exclusive advertising agency qualification for Tokyo 2020 Olympics in Taiwan.
  • Job responsibilities:
    • Optimize ad performance from all aspects, including the system, target audience tags, etc.
    • Do researches for new ML model (recommender model, NLP model) or architecture which is suitable for our system.
    • Develop data-related products or projects.
    • Analyze data to help improve our system or inspect whether the demands from business side is doable.
Lqnpwfiwbu3f99i6zod4

Real-time AD Recommender System

  • Objective:
    • Building a real-time ad recommender system to upgrade our ad server and get better performance.
  • Responsibilities:
    • Figure out what kind of recommender system components that is suitable for our ad system.
    • Build a tower-like and feature-cross model refer to other famous recommender system model.
    • Responsible for system engineering, which includes data preprocessing, embedding generates, memory cache, cold start, model API, etc.

Interest Tags

  • Objective:
    • Build interest tags for ads to help ad optimizers choose their target audience.
  • Responsibilities:
    • Create the features from what articles they saw, what website they viewed, and what ads they interacted.
    • Deal with 20 million rows data and 120 million inference samples.
    • Build ML model to predict each user's behavior on certain ads.
    • Using Spark through AWS EMR to accelerate the speed of producing tags.
  • Achievements:
    • Raise CTR performance up to 200-300% of the original tags depends on different tags, and gain more impression while maintain better performance.
    • After accomplishing this project, we terminated the cost on purchasing interest tags from other company, and successfully turned the original cost into revenue by providing profitable data.

First Party Cookie Mapping

  • Objective:
    • Deal with the Google 3rd party Cookie issue, figure out a method to map numerous 1st party Cookies to a user.
  • Responsibility:
    • Transform this problem into a ML mission. Design the label of the data, figure out what feature we can get or produce and whether the feature is useful for the goal.
    • Apply XGboost on this mission.
    • Build a small test to prove this method works.
  • Achievement:
    • 70% of precision.
    • One of the solution of our company while the cancelation of 3rd party Cookie happen.

Invoice Data Application

  • Objective:
    • Develop invoice data application.
  • Responsibility:
    • Responsible for fine-tuning BERT to predict category for each product.
    • Produce invoice data report to brands or business unit. It demonstrates the sales volume across different channel, what kind of products are frequently bought together, and also shows comparison of target brand to the other brands.
  • Achievements:
    • Produce an invoice data report product.
    • Produce invoice tags for ad system.

Other Experience

E.Sun AI 2020 Summer Competition, 2020.7 – 2020.8

  • Objective:
    • Extract names of money laundering suspects from an article.
  • Responsibilities:
    • Crawl the articles from different media, and parse them by using Selenium, Requests, and Beautiful Soup.
    • Construct 2-step model: First, identify whether the article is related to money laundering. Second, extract the suspects' names.
    • Build model serving API by Tensorflow Serving.
    • Build REST API for preprocessing request data and return the prediction.
  • Achievement:
    • 23rd place among 409 teams.

Youtube Data-Driven Marketing System, Institute for Information Industry, 2019.8 – 2019.11

  • Objectives:
    • Use the title and the description of videos to automatically classify videos.
    • Use the title and the description of videos to identify whether a video is sponsored.
    • Give suggestions for Youtubers or companies who desire to sponsor in a video based on data analysis.
  •  Responsibilities:
    • Apply Google API and write Python functions to get structured raw data.
    • Train word vectors using Gensim based on Wiki's open data. 
    • Use the frequency of each sentence as a criteria to eliminate useless words.
    • Tune LSTM, Conv1D, BERT on the NLP mission.
    • Use EDA methods to see the insights of the data under different classes and different sponsored status.
  • Achievement:
    • 71% accuracy in classifying video’s type.
    • 89% accuracy in detecting sponsored content.

E.Sun Real Estate Price Prediction Competition, 2019.7 – 2019.8

  • Objective:
    • Use the real estate training data to build a model and predict the real estate price within 10% residual.
  • Responsibilities:
    • Apply XGBoost, LGBM and other ML models to train the model.
    • Collect the outputs as new features from each ML model and add them into the original data set to enhance the performance of the final model.
  • Achievement:
    • 150th place out of 1200 teams.


KKTV Data Game,2017.5 – 2017.6

  • Objective:
    • Predict the next video a user watch in the next time interval.
  • Responsibilities:
    • Extract different features from raw data, such as the latest video, the video which got the longest viewing time, the video which got the largest number of viewing.
    • Use the user viewing data to construct a similarity matrix of each video as additional features.
  • Achievement:
    • 10th place out of 50 teams.


MRT Open Data Competition, 2017.4 – 2017.5

  • Objective:
    • Study the changes of passenger volume of MRT by surrounding geometric data.
  • Responsibilities:
    • Apply bisection method to build the edges between MRT stations.
    • Combine other geometric data based on these borders.
    • Use Lasso feature selection method to explore the importance of each feature.
    • Add noises into features to check the features are not randomly selected.
  • Achievement:
    • Certificate of Honorable Mention.


简历
个人档案
E3uoaqcxyy6dppaet0kg

許立農 | Hsu, Li-Nung


Data Scientist、Data Engineer
Taipei
[email protected]

Education

National Chenchi University, MS, Statistics, 2015 – 2017

  • GPA : 3.84 / 4.0
  • Master Thesis: Entropy Based Feature Selection, Professor Pei-Ting, Chou
    • Objective: Build a similarity matrix based on Mutual Entropy under Hierarchical Clustering. Afterwards, select clustered features as the final selection.
    • Compare the model with other feature selection methods like RF, Lasso, F-score.

Igtt7bfqhad2uml5y0ki

National Chen-Kung University, BS, Mathematics, 2011 – 2015


Kxc0f0caus5l9rwo4qji

Skills


Programing

  • Python
  • Scala
  • R
  • MSSQL


Data-related Tools

  • Tensorflow (Keras)
  • PyTorch
  • Spark
  • Docker
  • Scikit-Learn
  • Pandas


Cloud Platform

  • AWS
  • GCP


Language

  • English: TOEFL 98 / 120

Work Experience

CTBC Bank, Model Development Department, Data Scientist

2021.12 – present

  • About the department:
    • Responsible for developing models related to bank recommendations and risks, including projects such as coupon recommendations, account opening marketing lists, and fraud detection.
  • Job responsibilities:
    • Throughout the entire project lifecycle, my primary responsibilities included model design, model training, end-to-end process development, feature design, performance tracking, and method research.
Lqnpwfiwbu3f99i6zod4

Fraud Alert Project

  • Objective:
    • Predicting potential fraudulent accounts based on transaction data, restricting transactions in advance to prevent harm.
  • Responsibilities/Achievements:
    • Development and deployment of credit card and financial features.
    • Managing the data flow process from receiving variables to model predictions, identifying risk factors, and updating alert lists.
    • Implemented Autoencoder + contrastive learning to achieve a 1.81% improvement in model effectiveness.

Coupon Recommendation

  • Objective:
    • Personalized coupon recommendations for mobile banking users to increase click-through rates and redemption rates.
  • Responsibilities/Achievements:
    • Utilized multi-task learning to simultaneously predict click-through behavior and coupon redemptions, resulting in a 14% increase in click-through rate and a 74% increase in redemption rate.
    • Created performance tracking reports to monitor online model performance and provide insights to Business Units.

Financial Product Recommendations

  • Objective:
    • Tailored financial product recommendations for mobile banking users to enhance click-through rates without compromising conversion rates.
  • Responsibilities/Achievements:
    • Applied multi-task learning to jointly learn click-through and conversion behaviors, fine-tuned model architecture, achieving a 90% outperformance against competitor models in online testing.

Marketing List for Digital Savings Accounts

  • Objective:
    • Optimized conversion rates for marketing lists related to digital savings accounts
  • Responsibilities/Achievements:
    • successfully raising conversion rates from 0.23% to 1.16%

Work Experience

CLICKFORCE, Data Engineer Supervisor, 2020.1 – 2021.11

  • About the company:
    • As a top domestic digital advertisement company, CLICKFORCE cooperates with over 900 web media and over 400 mobile media to build a huge advertising environment. CLICKFORCE considers data-driven solution as the core concept of the company, and dedicates to help advertisers to achieve their commercial goals.
    • At 2020, CLICKFORCE won 2 awards at Agency & Advertiser of the Year.
    • Successfully acquire the exclusive advertising agency qualification for Tokyo 2020 Olympics in Taiwan.
  • Job responsibilities:
    • Optimize ad performance from all aspects, including the system, target audience tags, etc.
    • Do researches for new ML model (recommender model, NLP model) or architecture which is suitable for our system.
    • Develop data-related products or projects.
    • Analyze data to help improve our system or inspect whether the demands from business side is doable.
Lqnpwfiwbu3f99i6zod4

Real-time AD Recommender System

  • Objective:
    • Building a real-time ad recommender system to upgrade our ad server and get better performance.
  • Responsibilities:
    • Figure out what kind of recommender system components that is suitable for our ad system.
    • Build a tower-like and feature-cross model refer to other famous recommender system model.
    • Responsible for system engineering, which includes data preprocessing, embedding generates, memory cache, cold start, model API, etc.

Interest Tags

  • Objective:
    • Build interest tags for ads to help ad optimizers choose their target audience.
  • Responsibilities:
    • Create the features from what articles they saw, what website they viewed, and what ads they interacted.
    • Deal with 20 million rows data and 120 million inference samples.
    • Build ML model to predict each user's behavior on certain ads.
    • Using Spark through AWS EMR to accelerate the speed of producing tags.
  • Achievements:
    • Raise CTR performance up to 200-300% of the original tags depends on different tags, and gain more impression while maintain better performance.
    • After accomplishing this project, we terminated the cost on purchasing interest tags from other company, and successfully turned the original cost into revenue by providing profitable data.

First Party Cookie Mapping

  • Objective:
    • Deal with the Google 3rd party Cookie issue, figure out a method to map numerous 1st party Cookies to a user.
  • Responsibility:
    • Transform this problem into a ML mission. Design the label of the data, figure out what feature we can get or produce and whether the feature is useful for the goal.
    • Apply XGboost on this mission.
    • Build a small test to prove this method works.
  • Achievement:
    • 70% of precision.
    • One of the solution of our company while the cancelation of 3rd party Cookie happen.

Invoice Data Application

  • Objective:
    • Develop invoice data application.
  • Responsibility:
    • Responsible for fine-tuning BERT to predict category for each product.
    • Produce invoice data report to brands or business unit. It demonstrates the sales volume across different channel, what kind of products are frequently bought together, and also shows comparison of target brand to the other brands.
  • Achievements:
    • Produce an invoice data report product.
    • Produce invoice tags for ad system.

Other Experience

E.Sun AI 2020 Summer Competition, 2020.7 – 2020.8

  • Objective:
    • Extract names of money laundering suspects from an article.
  • Responsibilities:
    • Crawl the articles from different media, and parse them by using Selenium, Requests, and Beautiful Soup.
    • Construct 2-step model: First, identify whether the article is related to money laundering. Second, extract the suspects' names.
    • Build model serving API by Tensorflow Serving.
    • Build REST API for preprocessing request data and return the prediction.
  • Achievement:
    • 23rd place among 409 teams.

Youtube Data-Driven Marketing System, Institute for Information Industry, 2019.8 – 2019.11

  • Objectives:
    • Use the title and the description of videos to automatically classify videos.
    • Use the title and the description of videos to identify whether a video is sponsored.
    • Give suggestions for Youtubers or companies who desire to sponsor in a video based on data analysis.
  •  Responsibilities:
    • Apply Google API and write Python functions to get structured raw data.
    • Train word vectors using Gensim based on Wiki's open data. 
    • Use the frequency of each sentence as a criteria to eliminate useless words.
    • Tune LSTM, Conv1D, BERT on the NLP mission.
    • Use EDA methods to see the insights of the data under different classes and different sponsored status.
  • Achievement:
    • 71% accuracy in classifying video’s type.
    • 89% accuracy in detecting sponsored content.

E.Sun Real Estate Price Prediction Competition, 2019.7 – 2019.8

  • Objective:
    • Use the real estate training data to build a model and predict the real estate price within 10% residual.
  • Responsibilities:
    • Apply XGBoost, LGBM and other ML models to train the model.
    • Collect the outputs as new features from each ML model and add them into the original data set to enhance the performance of the final model.
  • Achievement:
    • 150th place out of 1200 teams.


KKTV Data Game,2017.5 – 2017.6

  • Objective:
    • Predict the next video a user watch in the next time interval.
  • Responsibilities:
    • Extract different features from raw data, such as the latest video, the video which got the longest viewing time, the video which got the largest number of viewing.
    • Use the user viewing data to construct a similarity matrix of each video as additional features.
  • Achievement:
    • 10th place out of 50 teams.


MRT Open Data Competition, 2017.4 – 2017.5

  • Objective:
    • Study the changes of passenger volume of MRT by surrounding geometric data.
  • Responsibilities:
    • Apply bisection method to build the edges between MRT stations.
    • Combine other geometric data based on these borders.
    • Use Lasso feature selection method to explore the importance of each feature.
    • Add noises into features to check the features are not randomly selected.
  • Achievement:
    • Certificate of Honorable Mention.