Nov 2019 - Present
• Developed an API server for auto scheduling and extracting the product feed and sales feed from MSPs.
• Designed and built the API server with Flask, Celery, DynamoDB, docker containers, along with AWS ECR, secret manager, Aurora.
• Connected APIs for extracting product information and sales reports, e.g., Walmart, eBay, Shopee, Lazada, Shopify, etc.
• Developed unit testing and integration testing.
• Developed a scalable web crawling system and a data pipeline for collecting product information from e-Commerce websites.
• Integrated Selenium, Splash, Python-requests into the crawling system.
• Designed and built the web crawling system with Scrapy-Splash, Docker containers on AWS EC2, along with ECR, ALB, Route 53, Lambda, Serverless, Ansible, Traefik
• Developed a downloader to automatically download files by using Play-wright, Selenium with proxy server.