I bring 5 years of hands-on experience in data engineering and software development, with a focus on building scalable data processing systems utilizing Hadoop, Spark, Kafka and Docker. My expertise in developing efficient ETL pipelines has been fundamental in optimizing data workflows for various data warehouses, enhancing data integrity and availability.
My track record includes managing high-volume data pipelines, automating scheduling processes to improve operational efficiency, and deploying monitoring solutions that have reduced Mean-Time-To-Repair (MTTR) by 40%. I have a strong foundation in SQL, especially PostgreSQL, which enables me to handle complex data analysis tasks effectively. My technical skill set is rounded out with proficiency in Python, Scala, Airflow for workflow management, Docker for containerization, and Linux shell scripting.
I have led projects that improved client procurement efficiency by 15% and increased deployment rates by 60%, demonstrating my ability to leverage data insights to drive business improvements.
With a Master's in Information Management and a Bachelor's in Economics, I possess a deep understanding of the data lifecycle, from mining and visualization to machine learning and statistical analysis, using tools like PowerBI, Seaborn, and Tableau.
My specialized skills in Time Series Analysis and Machine Learning are complemented by practical experience in data workflow management platforms, such as Airflow. I am proficient in English, which has been invaluable in my work within multidisciplinary teams, ensuring clear and effective communication.
Seeking a role as a Data Engineer or Data Analyst, I am eager to apply my technical expertise and analytical skills to contribute to meaningful projects and collaborate with a dynamic team.
July 2021 - Present
- Built and maintained data piplines (through which several hundred millions rows of data flow through daily) using Scala Spark/ Hadoop
- Managed cron jobs and performed regular data recovery using Apache Airflow
- Performed regular Extract, transform, load (ETL) operations through Hive and HDFS command line interfaces
- Utilized streaming technologies such as Kafka to store data into various pools and lakes alike
September 2020 - December 2020
Designed and developed student assessments/assignments for professors in
the School of Engineering, focusing on Data Management, Data Cleaning and
Scripting and Data Visualization using Python 3 and SQLite
April 2017 - July 2019
Java under Spring framework with JavaScript (AngularJS and jQuery), plus SQL Server, MySQL, and Amazon Web Services
Accelerated client's procurement process by 15 %
Utilized Amazon Web Services AWS EC2 for deployment and version control
Reduced bug occurrences by 60% in production
Raised bug awareness by 50% in dev/test
Obtained Professional Scrum Master I certification
September 2010 - June 2015
Lorem ipsum dolor sit amet, consectetuer adipiscing elit, sed diam nonummy nibh euismod tincidunt ut laoreet dolore magna aliquam erat volutpat.
July 2012 - January 2015
Lorem ipsum dolor sit amet, consectetuer adipiscing elit, sed diam nonummy nibh euismod tincidunt ut laoreet dolore magna aliquam erat volutpat.
September 2012 - September 2014
Lorem ipsum dolor sit amet, consectetuer adipiscing elit, sed diam nonummy nibh euismod tincidunt ut laoreet dolore magna aliquam erat volutpat.
2019 - 2021
2014 - 2015
2010 - 2014
2007 - 2010
I bring 5 years of hands-on experience in data engineering and software development, with a focus on building scalable data processing systems utilizing Hadoop, Spark, Kafka and Docker. My expertise in developing efficient ETL pipelines has been fundamental in optimizing data workflows for various data warehouses, enhancing data integrity and availability.
My track record includes managing high-volume data pipelines, automating scheduling processes to improve operational efficiency, and deploying monitoring solutions that have reduced Mean-Time-To-Repair (MTTR) by 40%. I have a strong foundation in SQL, especially PostgreSQL, which enables me to handle complex data analysis tasks effectively. My technical skill set is rounded out with proficiency in Python, Scala, Airflow for workflow management, Docker for containerization, and Linux shell scripting.
I have led projects that improved client procurement efficiency by 15% and increased deployment rates by 60%, demonstrating my ability to leverage data insights to drive business improvements.
With a Master's in Information Management and a Bachelor's in Economics, I possess a deep understanding of the data lifecycle, from mining and visualization to machine learning and statistical analysis, using tools like PowerBI, Seaborn, and Tableau.
My specialized skills in Time Series Analysis and Machine Learning are complemented by practical experience in data workflow management platforms, such as Airflow. I am proficient in English, which has been invaluable in my work within multidisciplinary teams, ensuring clear and effective communication.
Seeking a role as a Data Engineer or Data Analyst, I am eager to apply my technical expertise and analytical skills to contribute to meaningful projects and collaborate with a dynamic team.
July 2021 - Present
- Built and maintained data piplines (through which several hundred millions rows of data flow through daily) using Scala Spark/ Hadoop
- Managed cron jobs and performed regular data recovery using Apache Airflow
- Performed regular Extract, transform, load (ETL) operations through Hive and HDFS command line interfaces
- Utilized streaming technologies such as Kafka to store data into various pools and lakes alike
September 2020 - December 2020
Designed and developed student assessments/assignments for professors in
the School of Engineering, focusing on Data Management, Data Cleaning and
Scripting and Data Visualization using Python 3 and SQLite
April 2017 - July 2019
Java under Spring framework with JavaScript (AngularJS and jQuery), plus SQL Server, MySQL, and Amazon Web Services
Accelerated client's procurement process by 15 %
Utilized Amazon Web Services AWS EC2 for deployment and version control
Reduced bug occurrences by 60% in production
Raised bug awareness by 50% in dev/test
Obtained Professional Scrum Master I certification
September 2010 - June 2015
Lorem ipsum dolor sit amet, consectetuer adipiscing elit, sed diam nonummy nibh euismod tincidunt ut laoreet dolore magna aliquam erat volutpat.
July 2012 - January 2015
Lorem ipsum dolor sit amet, consectetuer adipiscing elit, sed diam nonummy nibh euismod tincidunt ut laoreet dolore magna aliquam erat volutpat.
September 2012 - September 2014
Lorem ipsum dolor sit amet, consectetuer adipiscing elit, sed diam nonummy nibh euismod tincidunt ut laoreet dolore magna aliquam erat volutpat.
2019 - 2021
2014 - 2015
2010 - 2014
2007 - 2010