CakeResume Talent Search

Advanced filters
On
4-6 years
6-10 years
10-15 years
More than 15 years
Avatar of the user.
Avatar of the user.
智慧製造全端開發工程師 @聯華電子股份有限公司
2022 ~ Present
AI工程師、機器學習工程師、深度學習工程師、影像演算法工程師、資料科學家、Ai Application Engineer,Machine Learning Engineer,Deep Learning Engineer,Data Scientist
Within one month
Python
Qt
Git
Employed
Ready to interview
Full-time / Interested in working remotely
4-6 years
元智大學 Yuan Ze University
工業工程與管理學系所
Avatar of 宋浩茹 Ellie Sung.
AI工程師、機器學習工程師、深度學習工程師、資料科學家、Machine Learning Engineer、Deep Learning Engineer、Data Scientist
Within one month
宋浩茹 Hao-Ru Sung| [email protected] | LinkedIn | GitHub A s a Research Assistant at Academia Sinica , specializing in Generative AI research and application. With 3 + years of experience in NLP a nd Machine Learning , along with 4+ years in Backend Development . Proficient at translating complex theories into practical applications. Skills Languages: Python, R, SQL, MATLAB, C, C#, JavaScript, Node.js Software & Tools: PyTorch, PyTorch Lightning, Tensorflow, Scikit-Learn, NLTK , GCP, Linux, SQL / NoSQ , Pandas, Hugging Face, Gradio, LangChain, Tensorflow, Keras, FastAPI, OpenCV, Airflow
Python
R
Natural Language Processing (NLP)
Employed
Ready to interview
Full-time / Interested in working remotely
4-6 years
國立政治大學(National Chengchi University)
資訊科學系
Avatar of 梁賦康 (Foo-Hong, Leong).
Avatar of 梁賦康 (Foo-Hong, Leong).
Product Manager @東元電機股份有限公司 (TECO Electric & Machinery Co. Ltd.)
2023 ~ 2023
Data Scientist, Data Analyst, Machine Learning Engineer
Within one month
梁賦康 (Foo-Hong, Leong) Taoyuan City, Taiwan Email: [email protected] Tel:Skills • Languages: Python • DataBases: MySQL, SQLite • Infrastructure tools: Github • Machine learning libraries: TensorFlow, Keras, and Scikit-learn • Data visualization tools: Power BI, Seaborn and Matplotlib • Deployment: Streamlit Summary I have been working in Motor Manufacturing Industry for 8 years. My first programming was going to my Bachelor's degree, C++ was the first program I learned. Then I started to learn Python in 2018 at TEDU and my first project was the Stock Trend Prediction by CNN. I kept
Python
Power BI
Data Analytics
Employed
Ready to interview
Full-time / Interested in working remotely
6-10 years
國立成功大學 National Cheng Kung University
Mechanical Engineering
Avatar of Ted Li.
Avatar of Ted Li.
Past
Senior Firmware Engineer @Artesyn Embedded Technologies
2019 ~ 2022
韌體工程師/軟體工程師/控制工程師/演算法工程師/
Within one month
Ted Li Senior Firmware Engineer Over 6 years of firmware/software development expertise as a Senior Firmware Engineer, specializing in embedded systems, cross- functional projects, and RL-optimizations. Driving global technical innovations and training. New Taipei City, Taiwan [email protected] https://github.com/armcortex https://www.linkedin.com/in/ted-li/ https://about.armcortex.cc/ Skill Programming C/C++ Python Bash SQL AI (PyTorch, TensorFlow, Keras) Tool RTOS Embedded System Git Docker/Docker Compose
C
Python
C/C++
Unemployed
Ready to interview
Full-time / Interested in working remotely
6-10 years
日本電氣通信大學 The University of Electro-Communications (UEC)
Robotics Engineering
Avatar of the user.
Avatar of the user.
Data Analyst & IoT Software Developer @FARAZ ERTEBAT
2021 ~ Present
Computer Vision / Deep Learning
Within one month
C#
C/C++
MySQL
Employed
Open to opportunities
Full-time / Interested in working remotely
4-6 years
University of Qom
Information Technology | Face Recognition
Avatar of Alex Yu.
Avatar of Alex Yu.
Product Manager @Linker Vision
2023 ~ Present
PM/產品經理/專案管理
Within one month
detection, segmentation, and classification AI scenario. Good communication skills with doctors' demands and collaboration with colleagues. Patent Disclosure: Ultrasound detect and notify system. (serial number: I學歷 SepJun 2 National Taiwan University of Science and Technology Masters in Electrical Engineering Thesis "Online Data Stream Analytics for Dynamic Environments Using Self-Regularized Learning Framework", IEEE journal SepJun 2020 Yuan Ze University Bachelor in Electrical Engineering Skills Customer/VC negotiation and customer services DL/ML/AI algorithm, keen problem solving 3D modeling (Blender) Python, Matlab, Tensorflow, Keras Object detection, Classification, [email protected]
Business Development
Deep Learning
PYTHON
Employed
Open to opportunities
Full-time / Interested in working remotely
4-6 years
國立台灣科技大學 National Taiwan University of Science and Technology
電機工程
Avatar of Kao Yu Chun.
Avatar of Kao Yu Chun.
Past
Full Stack Developer @PD Case Informática Ltda
2022 ~ Present
前端工程師 Front-End Developer
Within one month
高鈺鈞 我有超過5年軟體開發經驗的開發者,專注於前端開發。在我的職業生涯中,我有機會接觸到多種編程語言,如Java(包括Android開發),JavaScript,Kotlin,TypeScript和Python;同時也涉及到多種技術和框架,包括Angular,React,Redux,Spring Boot,TensorFlowKeras和OpenCV;還有Oracle,MySQL和ElasticSearch等資料庫技術。 我喜歡迎接
Java
Javascript
docker
Reputation Credits3
Unemployed
Open to opportunities
Full-time / Interested in working remotely
4-6 years
巴西聯邦大學
系統發展與分析
Avatar of the user.
Avatar of the user.
高級軟體工程師 @瀚宇彩晶股份有限公司
2022 ~ Present
Within three months
Python
Vue.js
Node.js / Express.js
Employed
Open to opportunities
Full-time / Interested in working remotely
6-10 years
國立臺中科技大學National Taichung University of Science and Technology.
資訊工程系-碩士
Avatar of 許立農.
Offline
Avatar of 許立農.
Offline
數據科學家 @中國信託商業銀行股份有限公司
2021 ~ Present
AI工程師、機器學習工程師、深度學習工程師、資料科學家、Machine Learning Engineer、Deep Learning Engineer、Data Scientist
Within three months
許立農 | Hsu, Li-Nung Data Scientist、Data Engineer Taipei [email protected] Education National Chenchi University, MS, Statistics, 2015 – 2017 GPA : 3.84 / 4.0 Master Thesis: Entropy Based Feature Selection, Professor Pei-Ting, Chou Objective: Build a similarity matrix based on Mutual Entropy under Hierarchical Clustering. Afterwards, select clustered features as the final selection. Compare the model with other feature selection methods like RF, Lasso, F-score. National Chen-Kung University, BS, Mathematics, 2011 – 2015 Skills Programing Python Scala R MSSQL Data-related Tools Tensorflow (Keras) PyTorch Spark Docker
Python
R
MSSQL
Employed
Open to opportunities
Full-time / Interested in working remotely
4-6 years
政治大學
統計
Avatar of 王豪逸.
Avatar of 王豪逸.
Software Developer @ZeroLogix
2022 ~ Present
前端工程師 Front-End Developer
Within one month
王豪逸 中山大學資訊管理學系 研究所 [email protected] https://github.com/strongball 技能 程式語言 JavaScript TypeScript Python C Java C# 開發工具 Git Docker Linux(ubuntu) Jest 前端開發 Vue, Nuxtjs React , Nextjs Graphql Restful WebSocket CSS-in-JS, styled-components Electron 後端開發 Graphql ORM(Typeorm, Prisma) MySQL, PostgreSQL 資料分析 / 機器學習 Pytorch TensorFlow Keras Scikit-learn 工作經歷 科林儀器
Python
React.js
Next.js
Employed
Open to opportunities
Full-time / Interested in working remotely
4-6 years
國立中山大學-資管系
資訊管理

The Most Lightweight and Effective Recruiting Plan

Search resumes and take the initiative to contact job applicants for higher recruiting efficiency. The Choice of Hundreds of Companies.

  • Browse all search results
  • Unlimited access to start new conversations
  • Resumes accessible for only paid companies
  • View users’ email address & phone numbers
Search Tips
1
Search a precise keyword combination
senior backend php
If the number of the search result is not enough, you can remove the less important keywords
2
Use quotes to search for an exact phrase
"business development"
3
Use the minus sign to eliminate results containing certain words
UI designer -UX
Only public resumes are available with the free plan.
Upgrade to an advanced plan to view all search results including tens of thousands of resumes exclusive on CakeResume.

Definition of Reputation Credits

Technical Skills
Specialized knowledge and expertise within the profession (e.g. familiar with SEO and use of related tools).
Problem-Solving
Ability to identify, analyze, and prepare solutions to problems.
Adaptability
Ability to navigate unexpected situations; and keep up with shifting priorities, projects, clients, and technology.
Communication
Ability to convey information effectively and is willing to give and receive feedback.
Time Management
Ability to prioritize tasks based on importance; and have them completed within the assigned timeline.
Teamwork
Ability to work cooperatively, communicate effectively, and anticipate each other's demands, resulting in coordinated collective action.
Leadership
Ability to coach, guide, and inspire a team to achieve a shared goal or outcome effectively.
Within six months
Data Scientist, Data Engineer
Logo of 中國信託商業銀行股份有限公司.
中國信託商業銀行股份有限公司
2021 ~ Present
台灣台北市
Professional Background
Current status
Employed
Job Search Progress
Open to opportunities
Professions
Data Scientist, Machine Learning Engineer
Fields of Employment
Banking, Artificial Intelligence / Machine Learning, AdTech / MarTech
Work experience
4-6 years
Management
None
Skills
Python
R
MSSQL
Scala
Linux
PyTorch
Tensorflow (Keras)
AWS
GCP
Spark
Tensorflow
pyspark
Languages
English
Fluent
Job search preferences
Positions
AI工程師、機器學習工程師、深度學習工程師、資料科學家、Machine Learning Engineer、Deep Learning Engineer、Data Scientist
Job types
Full-time
Locations
台灣台北, 台灣新北市
Remote
Interested in working remotely
Freelance
Yes, I freelance in my spare time
Educations
School
政治大學
Major
統計
Print
E3uoaqcxyy6dppaet0kg

許立農 | Hsu, Li-Nung


Data Scientist、Data Engineer
Taipei
[email protected]

Education

National Chenchi University, MS, Statistics, 2015 – 2017

  • GPA : 3.84 / 4.0
  • Master Thesis: Entropy Based Feature Selection, Professor Pei-Ting, Chou
    • Objective: Build a similarity matrix based on Mutual Entropy under Hierarchical Clustering. Afterwards, select clustered features as the final selection.
    • Compare the model with other feature selection methods like RF, Lasso, F-score.

Igtt7bfqhad2uml5y0ki

National Chen-Kung University, BS, Mathematics, 2011 – 2015


Kxc0f0caus5l9rwo4qji

Skills


Programing

  • Python
  • Scala
  • R
  • MSSQL


Data-related Tools

  • Tensorflow (Keras)
  • PyTorch
  • Spark
  • Docker
  • Scikit-Learn
  • Pandas


Cloud Platform

  • AWS
  • GCP


Language

  • English: TOEFL 98 / 120

Work Experience

CTBC Bank, Model Development Department, Data Scientist

2021.12 – present

  • About the department:
    • Responsible for developing models related to bank recommendations and risks, including projects such as coupon recommendations, account opening marketing lists, and fraud detection.
  • Job responsibilities:
    • Throughout the entire project lifecycle, my primary responsibilities included model design, model training, end-to-end process development, feature design, performance tracking, and method research.
Lqnpwfiwbu3f99i6zod4

Fraud Alert Project

  • Objective:
    • Predicting potential fraudulent accounts based on transaction data, restricting transactions in advance to prevent harm.
  • Responsibilities/Achievements:
    • Development and deployment of credit card and financial features.
    • Managing the data flow process from receiving variables to model predictions, identifying risk factors, and updating alert lists.
    • Implemented Autoencoder + contrastive learning to achieve a 1.81% improvement in model effectiveness.

Coupon Recommendation

  • Objective:
    • Personalized coupon recommendations for mobile banking users to increase click-through rates and redemption rates.
  • Responsibilities/Achievements:
    • Utilized multi-task learning to simultaneously predict click-through behavior and coupon redemptions, resulting in a 14% increase in click-through rate and a 74% increase in redemption rate.
    • Created performance tracking reports to monitor online model performance and provide insights to Business Units.

Financial Product Recommendations

  • Objective:
    • Tailored financial product recommendations for mobile banking users to enhance click-through rates without compromising conversion rates.
  • Responsibilities/Achievements:
    • Applied multi-task learning to jointly learn click-through and conversion behaviors, fine-tuned model architecture, achieving a 90% outperformance against competitor models in online testing.

Marketing List for Digital Savings Accounts

  • Objective:
    • Optimized conversion rates for marketing lists related to digital savings accounts
  • Responsibilities/Achievements:
    • successfully raising conversion rates from 0.23% to 1.16%

Work Experience

CLICKFORCE, Data Engineer Supervisor, 2020.1 – 2021.11

  • About the company:
    • As a top domestic digital advertisement company, CLICKFORCE cooperates with over 900 web media and over 400 mobile media to build a huge advertising environment. CLICKFORCE considers data-driven solution as the core concept of the company, and dedicates to help advertisers to achieve their commercial goals.
    • At 2020, CLICKFORCE won 2 awards at Agency & Advertiser of the Year.
    • Successfully acquire the exclusive advertising agency qualification for Tokyo 2020 Olympics in Taiwan.
  • Job responsibilities:
    • Optimize ad performance from all aspects, including the system, target audience tags, etc.
    • Do researches for new ML model (recommender model, NLP model) or architecture which is suitable for our system.
    • Develop data-related products or projects.
    • Analyze data to help improve our system or inspect whether the demands from business side is doable.
Lqnpwfiwbu3f99i6zod4

Real-time AD Recommender System

  • Objective:
    • Building a real-time ad recommender system to upgrade our ad server and get better performance.
  • Responsibilities:
    • Figure out what kind of recommender system components that is suitable for our ad system.
    • Build a tower-like and feature-cross model refer to other famous recommender system model.
    • Responsible for system engineering, which includes data preprocessing, embedding generates, memory cache, cold start, model API, etc.

Interest Tags

  • Objective:
    • Build interest tags for ads to help ad optimizers choose their target audience.
  • Responsibilities:
    • Create the features from what articles they saw, what website they viewed, and what ads they interacted.
    • Deal with 20 million rows data and 120 million inference samples.
    • Build ML model to predict each user's behavior on certain ads.
    • Using Spark through AWS EMR to accelerate the speed of producing tags.
  • Achievements:
    • Raise CTR performance up to 200-300% of the original tags depends on different tags, and gain more impression while maintain better performance.
    • After accomplishing this project, we terminated the cost on purchasing interest tags from other company, and successfully turned the original cost into revenue by providing profitable data.

First Party Cookie Mapping

  • Objective:
    • Deal with the Google 3rd party Cookie issue, figure out a method to map numerous 1st party Cookies to a user.
  • Responsibility:
    • Transform this problem into a ML mission. Design the label of the data, figure out what feature we can get or produce and whether the feature is useful for the goal.
    • Apply XGboost on this mission.
    • Build a small test to prove this method works.
  • Achievement:
    • 70% of precision.
    • One of the solution of our company while the cancelation of 3rd party Cookie happen.

Invoice Data Application

  • Objective:
    • Develop invoice data application.
  • Responsibility:
    • Responsible for fine-tuning BERT to predict category for each product.
    • Produce invoice data report to brands or business unit. It demonstrates the sales volume across different channel, what kind of products are frequently bought together, and also shows comparison of target brand to the other brands.
  • Achievements:
    • Produce an invoice data report product.
    • Produce invoice tags for ad system.

Other Experience

E.Sun AI 2020 Summer Competition, 2020.7 – 2020.8

  • Objective:
    • Extract names of money laundering suspects from an article.
  • Responsibilities:
    • Crawl the articles from different media, and parse them by using Selenium, Requests, and Beautiful Soup.
    • Construct 2-step model: First, identify whether the article is related to money laundering. Second, extract the suspects' names.
    • Build model serving API by Tensorflow Serving.
    • Build REST API for preprocessing request data and return the prediction.
  • Achievement:
    • 23rd place among 409 teams.

Youtube Data-Driven Marketing System, Institute for Information Industry, 2019.8 – 2019.11

  • Objectives:
    • Use the title and the description of videos to automatically classify videos.
    • Use the title and the description of videos to identify whether a video is sponsored.
    • Give suggestions for Youtubers or companies who desire to sponsor in a video based on data analysis.
  •  Responsibilities:
    • Apply Google API and write Python functions to get structured raw data.
    • Train word vectors using Gensim based on Wiki's open data. 
    • Use the frequency of each sentence as a criteria to eliminate useless words.
    • Tune LSTM, Conv1D, BERT on the NLP mission.
    • Use EDA methods to see the insights of the data under different classes and different sponsored status.
  • Achievement:
    • 71% accuracy in classifying video’s type.
    • 89% accuracy in detecting sponsored content.

E.Sun Real Estate Price Prediction Competition, 2019.7 – 2019.8

  • Objective:
    • Use the real estate training data to build a model and predict the real estate price within 10% residual.
  • Responsibilities:
    • Apply XGBoost, LGBM and other ML models to train the model.
    • Collect the outputs as new features from each ML model and add them into the original data set to enhance the performance of the final model.
  • Achievement:
    • 150th place out of 1200 teams.


KKTV Data Game,2017.5 – 2017.6

  • Objective:
    • Predict the next video a user watch in the next time interval.
  • Responsibilities:
    • Extract different features from raw data, such as the latest video, the video which got the longest viewing time, the video which got the largest number of viewing.
    • Use the user viewing data to construct a similarity matrix of each video as additional features.
  • Achievement:
    • 10th place out of 50 teams.


MRT Open Data Competition, 2017.4 – 2017.5

  • Objective:
    • Study the changes of passenger volume of MRT by surrounding geometric data.
  • Responsibilities:
    • Apply bisection method to build the edges between MRT stations.
    • Combine other geometric data based on these borders.
    • Use Lasso feature selection method to explore the importance of each feature.
    • Add noises into features to check the features are not randomly selected.
  • Achievement:
    • Certificate of Honorable Mention.


Resume
Profile
E3uoaqcxyy6dppaet0kg

許立農 | Hsu, Li-Nung


Data Scientist、Data Engineer
Taipei
[email protected]

Education

National Chenchi University, MS, Statistics, 2015 – 2017

  • GPA : 3.84 / 4.0
  • Master Thesis: Entropy Based Feature Selection, Professor Pei-Ting, Chou
    • Objective: Build a similarity matrix based on Mutual Entropy under Hierarchical Clustering. Afterwards, select clustered features as the final selection.
    • Compare the model with other feature selection methods like RF, Lasso, F-score.

Igtt7bfqhad2uml5y0ki

National Chen-Kung University, BS, Mathematics, 2011 – 2015


Kxc0f0caus5l9rwo4qji

Skills


Programing

  • Python
  • Scala
  • R
  • MSSQL


Data-related Tools

  • Tensorflow (Keras)
  • PyTorch
  • Spark
  • Docker
  • Scikit-Learn
  • Pandas


Cloud Platform

  • AWS
  • GCP


Language

  • English: TOEFL 98 / 120

Work Experience

CTBC Bank, Model Development Department, Data Scientist

2021.12 – present

  • About the department:
    • Responsible for developing models related to bank recommendations and risks, including projects such as coupon recommendations, account opening marketing lists, and fraud detection.
  • Job responsibilities:
    • Throughout the entire project lifecycle, my primary responsibilities included model design, model training, end-to-end process development, feature design, performance tracking, and method research.
Lqnpwfiwbu3f99i6zod4

Fraud Alert Project

  • Objective:
    • Predicting potential fraudulent accounts based on transaction data, restricting transactions in advance to prevent harm.
  • Responsibilities/Achievements:
    • Development and deployment of credit card and financial features.
    • Managing the data flow process from receiving variables to model predictions, identifying risk factors, and updating alert lists.
    • Implemented Autoencoder + contrastive learning to achieve a 1.81% improvement in model effectiveness.

Coupon Recommendation

  • Objective:
    • Personalized coupon recommendations for mobile banking users to increase click-through rates and redemption rates.
  • Responsibilities/Achievements:
    • Utilized multi-task learning to simultaneously predict click-through behavior and coupon redemptions, resulting in a 14% increase in click-through rate and a 74% increase in redemption rate.
    • Created performance tracking reports to monitor online model performance and provide insights to Business Units.

Financial Product Recommendations

  • Objective:
    • Tailored financial product recommendations for mobile banking users to enhance click-through rates without compromising conversion rates.
  • Responsibilities/Achievements:
    • Applied multi-task learning to jointly learn click-through and conversion behaviors, fine-tuned model architecture, achieving a 90% outperformance against competitor models in online testing.

Marketing List for Digital Savings Accounts

  • Objective:
    • Optimized conversion rates for marketing lists related to digital savings accounts
  • Responsibilities/Achievements:
    • successfully raising conversion rates from 0.23% to 1.16%

Work Experience

CLICKFORCE, Data Engineer Supervisor, 2020.1 – 2021.11

  • About the company:
    • As a top domestic digital advertisement company, CLICKFORCE cooperates with over 900 web media and over 400 mobile media to build a huge advertising environment. CLICKFORCE considers data-driven solution as the core concept of the company, and dedicates to help advertisers to achieve their commercial goals.
    • At 2020, CLICKFORCE won 2 awards at Agency & Advertiser of the Year.
    • Successfully acquire the exclusive advertising agency qualification for Tokyo 2020 Olympics in Taiwan.
  • Job responsibilities:
    • Optimize ad performance from all aspects, including the system, target audience tags, etc.
    • Do researches for new ML model (recommender model, NLP model) or architecture which is suitable for our system.
    • Develop data-related products or projects.
    • Analyze data to help improve our system or inspect whether the demands from business side is doable.
Lqnpwfiwbu3f99i6zod4

Real-time AD Recommender System

  • Objective:
    • Building a real-time ad recommender system to upgrade our ad server and get better performance.
  • Responsibilities:
    • Figure out what kind of recommender system components that is suitable for our ad system.
    • Build a tower-like and feature-cross model refer to other famous recommender system model.
    • Responsible for system engineering, which includes data preprocessing, embedding generates, memory cache, cold start, model API, etc.

Interest Tags

  • Objective:
    • Build interest tags for ads to help ad optimizers choose their target audience.
  • Responsibilities:
    • Create the features from what articles they saw, what website they viewed, and what ads they interacted.
    • Deal with 20 million rows data and 120 million inference samples.
    • Build ML model to predict each user's behavior on certain ads.
    • Using Spark through AWS EMR to accelerate the speed of producing tags.
  • Achievements:
    • Raise CTR performance up to 200-300% of the original tags depends on different tags, and gain more impression while maintain better performance.
    • After accomplishing this project, we terminated the cost on purchasing interest tags from other company, and successfully turned the original cost into revenue by providing profitable data.

First Party Cookie Mapping

  • Objective:
    • Deal with the Google 3rd party Cookie issue, figure out a method to map numerous 1st party Cookies to a user.
  • Responsibility:
    • Transform this problem into a ML mission. Design the label of the data, figure out what feature we can get or produce and whether the feature is useful for the goal.
    • Apply XGboost on this mission.
    • Build a small test to prove this method works.
  • Achievement:
    • 70% of precision.
    • One of the solution of our company while the cancelation of 3rd party Cookie happen.

Invoice Data Application

  • Objective:
    • Develop invoice data application.
  • Responsibility:
    • Responsible for fine-tuning BERT to predict category for each product.
    • Produce invoice data report to brands or business unit. It demonstrates the sales volume across different channel, what kind of products are frequently bought together, and also shows comparison of target brand to the other brands.
  • Achievements:
    • Produce an invoice data report product.
    • Produce invoice tags for ad system.

Other Experience

E.Sun AI 2020 Summer Competition, 2020.7 – 2020.8

  • Objective:
    • Extract names of money laundering suspects from an article.
  • Responsibilities:
    • Crawl the articles from different media, and parse them by using Selenium, Requests, and Beautiful Soup.
    • Construct 2-step model: First, identify whether the article is related to money laundering. Second, extract the suspects' names.
    • Build model serving API by Tensorflow Serving.
    • Build REST API for preprocessing request data and return the prediction.
  • Achievement:
    • 23rd place among 409 teams.

Youtube Data-Driven Marketing System, Institute for Information Industry, 2019.8 – 2019.11

  • Objectives:
    • Use the title and the description of videos to automatically classify videos.
    • Use the title and the description of videos to identify whether a video is sponsored.
    • Give suggestions for Youtubers or companies who desire to sponsor in a video based on data analysis.
  •  Responsibilities:
    • Apply Google API and write Python functions to get structured raw data.
    • Train word vectors using Gensim based on Wiki's open data. 
    • Use the frequency of each sentence as a criteria to eliminate useless words.
    • Tune LSTM, Conv1D, BERT on the NLP mission.
    • Use EDA methods to see the insights of the data under different classes and different sponsored status.
  • Achievement:
    • 71% accuracy in classifying video’s type.
    • 89% accuracy in detecting sponsored content.

E.Sun Real Estate Price Prediction Competition, 2019.7 – 2019.8

  • Objective:
    • Use the real estate training data to build a model and predict the real estate price within 10% residual.
  • Responsibilities:
    • Apply XGBoost, LGBM and other ML models to train the model.
    • Collect the outputs as new features from each ML model and add them into the original data set to enhance the performance of the final model.
  • Achievement:
    • 150th place out of 1200 teams.


KKTV Data Game,2017.5 – 2017.6

  • Objective:
    • Predict the next video a user watch in the next time interval.
  • Responsibilities:
    • Extract different features from raw data, such as the latest video, the video which got the longest viewing time, the video which got the largest number of viewing.
    • Use the user viewing data to construct a similarity matrix of each video as additional features.
  • Achievement:
    • 10th place out of 50 teams.


MRT Open Data Competition, 2017.4 – 2017.5

  • Objective:
    • Study the changes of passenger volume of MRT by surrounding geometric data.
  • Responsibilities:
    • Apply bisection method to build the edges between MRT stations.
    • Combine other geometric data based on these borders.
    • Use Lasso feature selection method to explore the importance of each feature.
    • Add noises into features to check the features are not randomly selected.
  • Achievement:
    • Certificate of Honorable Mention.