I am a passionate software engineer, fascinated by information technology and its versatile software ecosystem. I have been working in the software field, especially in data engineering, for more than 9 years. For my near future career plan, I will keep learning more topics like service architecture at scale, advanced data engineering, machine learning-enhanced software models, and cloud architecture. At some point in the future, I wish I could be a software architect or a data architect to revolutionize the whole software model.
I have below experiences, but not limited to:
#Java #Scala #JavaScript #Python #Shell #Matlab #PHP #Spark #Hadoop #Glue #Athena #Kafka #Solr #Redis #MySQL #MSSQL #AWS #GCP #Git #Bootstrap #HTML/CSS #SpringBoot #PlayFramework #CodeIgniter #Tomcat #Jersey #Jetty
2023/03 - Present
Data Platform - Alchemy Team
1. The key person to create an automation testing framework for cross-team spark ETL modules.
2. Using Gitlab-CI to streamline the upstream and downstream projects across different teams.
3. Create and design the test plan based on Gitflow.
4. Create the activity log reporter in the pipeline using SNS.
5. Remove the Glue crawler to reduce the overhead and give the schema evolution solutions.
Skill Set: Scala, Java, Python, Spark, Glue, Athena, AWS Systems Manager, Intellij, Scalatest, Airflow
2019/02 - Present
Health Data Cloud - Data Model Infrastructure team
1. Build the continuous integration flow on Gitlab to increase the velocity of the development cycle.
2. Use Java and TestNG to design and develop a cross-team (four-team) CI framework to enhance the process of E2E testing.
3. Use Scala and Scalatest to design and develop an automation testing framework at the ingestion stage.
4. Clinical fake data generator, using Python and Faker.
5. Design test plans and test cases and coordinate the whole testing process in the team.
6. Build a centralized and standardized testing flow to give a clear guide to the QA folks.
7. Coordinate with Dev and QA from three countries (US, India, Taiwan).
8. Personal-made documentation:
https://jasonchang-ccs.gitbook.io/software-testing/
https://jasonchang-ccs.gitbook.io/hello-python-tutorial/
Skill Set: Scala, Java, Python, TestNG, SBT, Spark, Scalatest, Gitlab-CI, AWS, Postman
2016/03 - 2017/06
Company Founder, chief software engineer.
Take charge of all software and infrastructure-related work. Using PHP and CodeIgniter to build a hospital foundation website, as the following: http://www.landseedhealthcare.org.tw/
Use SpringBoot, Bootstrap, jQuery, MySQL, and GCP to build the company website and products.
Skill Set: PHP, CodeIgniter, AWS, Bootstrap, Javascript, HTML/CSS, jQuery, MySQL, Java, SpringBoot
2014/09 - 2016/02
Big Data Team
1. Use Java multi-tread library to create an ETL system from multi-sources (MSSQL, HBASE, Redis, Kafka) to Solr.
2. Create a multi-threaded ETL framework, using Java Phaser.
3. Apache Solr system maintenance and monitoring.
4. HBase, Hadoop, Redis related NoSQL database maintenance and monitoring.
5. MSSQL Fake Data Generator, using Java
6. Use Play Framework, jQuery, and Bootstrap to create an HBase monitor web app.
Skill Set: Java, Javascript, Play Framework, Jersey, Solr, Kafka, Hadoop, HBase, Redis, Bootstrap, AngularJS, MSSQL, Maven, Tomcat, Shell, Linux
2012/07 - 2014/07
Except for the above two IEEE papers, I also have experience of:
1. Use JSoup to create a PTT crawler and design and create an MVP of a web app service
2. Data mining on a big set of wifi data and using Gephi to create a social graph
3. Analysis of AWS shopping dataset to find several SNA coefficients to find the relationship between buyers
4. Build a dog nose recognition system (the paper published in 2011) by using Hadoop MapReduce
5. Implement TFIDF algorithm based on MapReduce
6. Implement several AI algorithms, including Gene Algo, Ant Algo, ANN Algo, etc.
7. Use the SVM library to implement Dimension Reduction
2004/07 - 2008/07
發照日期 十月 2010 · 永久有效
I am a passionate software engineer, fascinated by information technology and its versatile software ecosystem. I have been working in the software field, especially in data engineering, for more than 9 years. For my near future career plan, I will keep learning more topics like service architecture at scale, advanced data engineering, machine learning-enhanced software models, and cloud architecture. At some point in the future, I wish I could be a software architect or a data architect to revolutionize the whole software model.
I have below experiences, but not limited to:
#Java #Scala #JavaScript #Python #Shell #Matlab #PHP #Spark #Hadoop #Glue #Athena #Kafka #Solr #Redis #MySQL #MSSQL #AWS #GCP #Git #Bootstrap #HTML/CSS #SpringBoot #PlayFramework #CodeIgniter #Tomcat #Jersey #Jetty
2023/03 - Present
Data Platform - Alchemy Team
1. The key person to create an automation testing framework for cross-team spark ETL modules.
2. Using Gitlab-CI to streamline the upstream and downstream projects across different teams.
3. Create and design the test plan based on Gitflow.
4. Create the activity log reporter in the pipeline using SNS.
5. Remove the Glue crawler to reduce the overhead and give the schema evolution solutions.
Skill Set: Scala, Java, Python, Spark, Glue, Athena, AWS Systems Manager, Intellij, Scalatest, Airflow
2019/02 - Present
Health Data Cloud - Data Model Infrastructure team
1. Build the continuous integration flow on Gitlab to increase the velocity of the development cycle.
2. Use Java and TestNG to design and develop a cross-team (four-team) CI framework to enhance the process of E2E testing.
3. Use Scala and Scalatest to design and develop an automation testing framework at the ingestion stage.
4. Clinical fake data generator, using Python and Faker.
5. Design test plans and test cases and coordinate the whole testing process in the team.
6. Build a centralized and standardized testing flow to give a clear guide to the QA folks.
7. Coordinate with Dev and QA from three countries (US, India, Taiwan).
8. Personal-made documentation:
https://jasonchang-ccs.gitbook.io/software-testing/
https://jasonchang-ccs.gitbook.io/hello-python-tutorial/
Skill Set: Scala, Java, Python, TestNG, SBT, Spark, Scalatest, Gitlab-CI, AWS, Postman
2016/03 - 2017/06
Company Founder, chief software engineer.
Take charge of all software and infrastructure-related work. Using PHP and CodeIgniter to build a hospital foundation website, as the following: http://www.landseedhealthcare.org.tw/
Use SpringBoot, Bootstrap, jQuery, MySQL, and GCP to build the company website and products.
Skill Set: PHP, CodeIgniter, AWS, Bootstrap, Javascript, HTML/CSS, jQuery, MySQL, Java, SpringBoot
2014/09 - 2016/02
Big Data Team
1. Use Java multi-tread library to create an ETL system from multi-sources (MSSQL, HBASE, Redis, Kafka) to Solr.
2. Create a multi-threaded ETL framework, using Java Phaser.
3. Apache Solr system maintenance and monitoring.
4. HBase, Hadoop, Redis related NoSQL database maintenance and monitoring.
5. MSSQL Fake Data Generator, using Java
6. Use Play Framework, jQuery, and Bootstrap to create an HBase monitor web app.
Skill Set: Java, Javascript, Play Framework, Jersey, Solr, Kafka, Hadoop, HBase, Redis, Bootstrap, AngularJS, MSSQL, Maven, Tomcat, Shell, Linux
2012/07 - 2014/07
Except for the above two IEEE papers, I also have experience of:
1. Use JSoup to create a PTT crawler and design and create an MVP of a web app service
2. Data mining on a big set of wifi data and using Gephi to create a social graph
3. Analysis of AWS shopping dataset to find several SNA coefficients to find the relationship between buyers
4. Build a dog nose recognition system (the paper published in 2011) by using Hadoop MapReduce
5. Implement TFIDF algorithm based on MapReduce
6. Implement several AI algorithms, including Gene Algo, Ant Algo, ANN Algo, etc.
7. Use the SVM library to implement Dimension Reduction
2004/07 - 2008/07
發照日期 十月 2010 · 永久有效