Position: Technical Operation Engineer
Responsibilities
Build infrastructure of UAT and production environment
-
Work with team member to build and test UAT and production environment
-
Remediate security vulnerabilities identified in the environment to meet compliance requirements
-
Utilize advanced diagnostic tools and techniques to identify underlying issues
Daily technical operation of the UAT and production environment
-
Investigate and diagnose complex incidents and issues. Perform root cause analysis for recurring or critical incidents
-
Coordinate with L1 support to ensure proper incident handover and resolution
-
Collaborate with security team in security event/incident response
Documentation and knowledge management
-
Update and maintain documentation related to troubleshooting procedures and incident resolutions
-
Contribute to the knowledge base by documenting new issues and solutions
Role requirements
-
Bachelor’s degree in Computer Science, Information Technology, or a related field (or equivalent work experience)
-
Proven experience as an IT Operations and Maintenance Engineer or similar role, with a focus on troubleshooting and incident management
-
Strong technical expertise in IT systems, networks, storage, and virtualization technologies
-
Experience with cloud administration (tenant space) for both infrastructure and application, including VM/container/image management, application deployment etc.
-
Experience with monitoring tools and IT management software (e.g., Grafana, Nagios, SolarWinds, Microsoft System Center)
-
Solid understanding of ITIL practices, including incident, problem, change, and release management
-
Excellent analytical and problem-solving skills, with a strong attention to detail
-
Ability to work effectively under pressure and prioritize tasks in a dynamic environment
-
Professional oral English skills, able to read and write English emails and documents
Required to know most of below technologies:
-
Linux OS: CentOS, Redhat
-
CloudFlare: Anti-DDoS, WAF, CDN, DNS
-
Azure: Web Application Gateway, Firewall, AE server (DNS), SLB, VPC, security group, ECS, MySQL (HA), Key vault, BLOB, Redis, Bastion
-
Nginx, ELK, Kafka, Nacos, RabbitMQ, XXL-job cluster, ES, Canal, Zookeeper, VPN
-
Application: Web services (API, management system), backend services, operation frontend service, OP1 (CICD, ELK, middleware management, monitoring system)