說明
Senior Staff - Systems Engineer
The ICT Reliability Engineering team is dedicated to maintaining the continuity and stability of Coupang’s enterprise IT services. The team operates and continuously improves monitoring systems for both IT infrastructure and applications, ensuring high visibility and rapid incident detection. In the event of service disruptions, the team collaborates closely with engineering and operations teams to resolve issues efficiently and manage key performance metrics. Additionally, the team leads regular disaster recovery (DR) tests to validate system resilience and ensure business continuity.
Key Responsibilities:
- Automate deployment, configuration, and scaling on monitoring tools using:
- Terraform, Ansible, Puppet, or similar IaC tools
- REST APIs for platforms like Zabbix, SolarWinds, Prometheus, and Grafana
- Develop reusable automation scripts and templates to standardize monitoring across environments
- Integrate monitoring solutions with alerting, ticketing, and reporting systems
- Implement tagging strategies and observability standards to ensure consistent data collection
- Support incident response by providing automated diagnostics and data enrichment
- Collaborate with DevOps and SRE teams to align monitoring automation with CI/CD pipelines
Tech Skills:
- Infrastructure as Code (IaC):
- Proficiency in tools like Terraform, Ansible,
Puppet, or Jsonnet - Scripting & Automation:
- Strong skills in Python, Bash, or PowerShell for automation tasks
- Monitoring Tools:
- Experience with monitoring platforms (Monitoring Servers, Networks Devices, Network Liknks)
- API Integration:
- Experience working with REST APIs to automate monitoring configurations and data extraction
- Cloud Platforms:
- Good understanding of AWS services and monitoring cloud-native infrastructure
- CI/CD Integration:
- Familiarity with integrating monitoring automation into CI/CD pipelines
- Strong grasp of OSI model, TCP/IP, and network/system internals
- Experience with Linux/Unix systems and basic Windows server environments
Preferred Qualifications:
- Experience with containerized environments(Docker, Kubernetes)
- Relevant certifications (e.g., AWS Certified DevOps Engineer, HashiCorp Terraform Associate, CCIE)
人人均等的機會
Coupang提供均等的機會給所有員工。若沒有全球多元團隊的寶貴意見,我們不可能達成史無前例的成功。