Yen-Ting Liu 我具有5年python資料分析,熟悉以Docker搭配nginx, redis部屬api及系統於GCP上。熟悉Airflow程式及報表自動化分析流程,並有Hadoop,Elasticsearch群集管理實務、pyspark數據ETL經驗。我喜歡學習新技術,並追求以更高效率進行資料處理流程。 Santa Clara, CA, USA [email protected] 工作經歷 Data Engineer
陳昭儒(Chao-Ju Chen) Github [email protected] Education National Taiwan University Bachelor’s Degree, Electrical Engineering 2012 ~ 2017 Project Highlights Aggregating Files in one ETL, output 60B row to Data Warehouse Input :gzipped files(200GB in total) Task : Loading columns with values parsed from each gzipped file name. Wrote to BigQuery existing table(specific schema) in parallel. Tool: GCP Dataflow(Hosted Serverless Apache Beam) Result : The job took 40min to finish. Machine Type: n1-standard-1(1 vcpu, 3.75GB memory) Autoscaled up to 122 workers at peak. The data
known their requirement and current difficulty, and guide end-user to establish their own analysis flow, thus reducing and replacing many daily manual analysis processes. In the meantime, i have experience on In-house user training too. iv. ETL for Tableau. I write python script on pyspark to summary daily output, machine error code, quality checking data, and pass it to Tableau for visualization. v. Unscheduled AI and statistical education and training for production line person and engineers. 學歷 SepJun 2012 逢甲大學 Applied Mathematics - Master degree 技能 Data
Data Augmentation for Rare Defect Images
Signal Processing & Recognition
Administrator for Engineering Data Analysis System
Ignazio Panades With 5 years in the data industry, covering database administration, data modeling, BI reporting and data science, I seek a non-consulting role as a Data Scientist or Data Analyst in a company that aligns with my values . Eager to assume greater responsibilities, work autonomously, and make a significant impact. Bergamo, Italy [email protected] Experience Consultant (DBA & Data Scientist) • Avanade Italy MarchPresent Predictive maintenance (Data Analysis and Machine Learning, PySpark) Database Administration (Data Analysis and Data Modeling, SSMS/SSIS) Database Administrator • sorint.tek DecemberFebruary 2019 Database Administration (Cloudera Hadoop
SQL
Python
PySpark
Full-time / Tertarik bekerja jarak jauh
4-6 tahun
Bergamo University
・
Master Degree in Management , Finance and International Business
Paket Perekrutan Paling Mudah dan Efektif, Pilihan Ratusan Perusahaan
Cari lebih dari 800 ribu CV dan ambil aksi menghubungi pelamar kerja untuk rekrutmen yang lebih efektif. Pilihan ratusan perusahaan.