Avatar of tangwei.
tangwei
NLP Engineer / Data Analyst
ProfileResume
Posts
15Connections
Print
Avatar of the user.

tangwei

NLP Engineer / Data Analyst
Graduated from Monash University with the master's degree in Data Science in 2021, and familiar with Python, Database, and Machine Learning skills. 2+ years of experience in Data Science in NLP, Graph Database (Neo4j), and MLops. Having Google Cloud Certified professional machine learning engineer and Neo4j Graph Data Science Certification.
Logo of the organization.
TPISoftware
Logo of the organization.
Monash University
Taipei, 台灣

Professional Background

  • Current status
    Employed
  • Profession
    Data Scientist
    Data Engineer
    Machine Learning Engineer
  • Fields
    Big Data
    Artificial Intelligence / Machine Learning
    Information Services
  • Work experience
    2-4 years (2-4 years relevant)
  • Management
    None
  • Skills
    Python
    SQL
    R
    Spark
    Tableau
    Machine Learning
    ETL
    MLOps
    Neo4j
    PostreSQL
    MySQL
    Database
    Docker
    tensorflow
    PyTorch
    NLP
    Google Cloud Platform (GCP)
    Airflow
  • Languages
    English
    Professional
  • Highest level of education
    Master

Job search preferences

  • Desired job type
    Full-time
    Interested in working remotely
  • Desired positions
    Data Analyst 數據分析師 / Data Scientist 資料科學家
  • Desired work locations
  • Freelance

Work Experience

Logo of the organization.

Data Analyst

TPISoftware
Full-time
Mar 2021 - Present
Taipei City, Taiwan
MLops: • Cooperate with Google Inc to Build MLops on GCP for Customers. • Build ETL pipeline by using airflow and dataflow on GCP. • Use Kubeflow Python SDK to build training pipeline with AutoML and Custom model. • CI/CD with Cloud Repository and Cloud Build on GCP. • Continuous train pipeline build on GCP. ETL: • Batch Export Cloud SQL table to Cloud Storage by using Cloud Kubernetes Engine • Build ETL job by using Cloud Workflow on GCP NLP: • Build sentence classification model by using BERT, tf-idf, word2vec, Sentence-BERT in the government agency project. • Build Docker Image for model serving. • Use Django framework for RESTful API. Graph Data Science: • Use Graph Algorithms in Neo4j to detect smuggling crime in RFP for the government agency. • Use Neo4j to build knowledge graphs. • As an employee training speaker about teaching Neo4j Graph Database for Cathay. • Applying Data Science in Finance: Neo4j with GDS and ML for Financial Fraud Detection
Logo of the organization.

Data Assistant

Dec 2019 - Feb 2020
3 mos
Taipei City, Taiwan
• Interact with PostgreSQL • Data Wrangling • Word Tokenization • TF-IDF • Word Embedding (Word2Vec) • Topic Modelling (LDA) • Data Labeling • Assist to set up internal label system (Django)
Logo of the organization.

Quality Assurance Intern

ULSee
Internship
Dec 2018 - Mar 2019
4 mos
• Quality Assurance Test • Update QA Table • Unity/Blender
Logo of the organization.

Foreign Remittance Staff

Oct 2017 - Oct 2018
1 yr 1 mo
• Provide foreign remittance to the client. • Help the client exchange money to foreign currency. • Process big foreign remittance data, and analyze data • Answer client all questions on foreign remittance. • Communicate and coordinate with other departments.
Logo of the organization.

Teaching Assistant

Feb 2015 - Jul 2015
6 mos
• Taught students to use Excel to build a stock portfolio and use big data to complete targets. • Helped the teacher teach Asset pricing. • Prepared teaching equipment for the teacher. • Offered advice for students and help them find solutions.

Education

Logo of the organization.
Master’s Degree
Data Science
2019 - 2021
Description
Databases: • Introduction to databases (OracleSQL) Data Science (Basic): • Introduction to data science (Python, R) • Data Wrangling (Python) • Modelling for data analysis (Python) Data Science (Advance): • Data processing for big data (Python, Spark, MongoDB) • Data analysis for semi-structured data(Python- PyTorch) • Machine Learning(Python, R) • Applied Data Analysis(Python, R) • Data Exploration and visualisation (Python, Tableau, R, D3)

Licenses & Certifications

Logo of the organization.
Google
Credential ID: 722905
Issued Feb 2022
No Expiration Date
Logo of the organization.

Neo4j Certified Professional

Neo4j
Credential ID: 17156968
Issued May 2021
No Expiration Date
Logo of the organization.

Graph Data Science Certified

Neo4j
Credential ID: 17150531
Issued Apr 2021
No Expiration Date