Profile 02 00@2x 71843ef6a0df47d6255a9c0436c409dcd5cd81f6514c51a6b2a93339d82bbff6

linsam

data engineer、data scientist  •  台灣  •  [email protected]

Experience with data mining, machine learning, and web crawling. Hopes to focus more on data science and data engineer in future career.

Skills


Data Mining

Python - numpy, pandas, sklearn. 

R - parallel, dplyr, data.table, mice.


Machine Learning

Python - xgboost-gpu. 

R - xgboost, svm, random forest, knn.


Deep Learning

Python - kears-CNN.


Statistical Model

R - GLM, GLMNET, NLS, SUR, MLE.


Web Crawling

Python - request, BeautifulSoup, selenium.


Others

Execting deployment MySQL on ubuntu. 

Changing IP address to entity address by No-IP and installing SSL certificates by Let’s Encrypt.

Projects


Bosch Production Line Performance - Kaggle 

Post-competition analysis, top 6% rank.


Highly imbalance data, ratio is 1000 : 1, 10 GB dataset size. And the data is 50% missing value. More than 4000 variables, but I build machine learning models by only 50 features.



Grupo Bimbo Inventory Demand - Kaggle 

Post-competition analysis, top 8% rank.


Time series problem, eighty millions data size. Building models to predict inventory demand after 2 weeks.


Rossmann Store Sales - Kaggle 




Post-competition analysis, top 10% rank.


Time series problem. Building models to predict sales after 48 days.


Instacart Market Basket Analysis - Kaggle

Real competition, top 25% rank.


Predicting which products will an consumer purchase again.


Open Source of PTT Data

99 stars on github.




FB-ChatBot

Automatic ordering Taiwan train tickets, and recognizing Taiwan train verification codes by CNN models.

Work Experience


NDHU - RA

Mar. 2016 - Aug. 2017 

Analysing G7 financial data. Model validation and parameter estimation by regression models ( SUR, MLE, Bootstrapping ). 

And comparing single equation estimators and confidence interval with system equation.


NDHU - TA

Sep. 2015 - Jul. 2017

Calculus, Linear Algebra, Statistics

Languages


R, Python. Basic in English and proficient in Chinese.

Powered by CakeResumePowered by CakeResume
Powered by CakeResumePowered by CakeResume