1. Develop and operate data warehouse system and ETL pipeline for data access, collection, processing and storage, and support data analysis tasks
2. Manage deployment of the platform on public clouds with hundreds of instances across the globe
3. Dedicate to Big Data and Machine Learning Platform using Apache Spark and related technologies
4. Responsible for laying the foundation for the platform as well as proposing solutions to ease software development, monitoring of software, etc.
5. Handling hundred of terabyte data