We are seeking a highly skilled Cloud Support Engineer with strong hands-on experience in AWS and Azure production environments. This role focuses on supporting customer workloads, provisioning cloud infrastructure, resolving incidents, and ensuring optimal performance of mission-critical systems. The ideal candidate brings a strong background in cloud operations and excels in direct client support scenarios.
-
Design, deploy, and manage cloud infrastructure using AWS best practices.
-
Monitor, troubleshoot, and optimize performance in production environments.
-
Ensure system security, data protection, and compliance with organizational and regulatory standards.
-
Support high availability and disaster recovery planning and implementation.
-
Collaborate with developers and other engineers to support application deployment and scalability.
-
Respond to production incidents and participate in root cause analysis and remediation.
-
Provision, configure, and manage cloud resources including EC2/VMs, storage, security groups, load balancers, VPC networking, IAM policies.
-
Support hybrid environments and assist customers with migrations between on-prem and cloud.
-
Monitor infrastructure performance and availability using CloudWatch, Azure Monitor, and other alerting tools.
-
Respond to support tickets for outages, degraded performance, networking issues, and account configuration requests.
-
Implement cloud security controls, patching oversight, backup and recovery operations.
-
Troubleshoot connectivity, DNS, VPN, firewall/security rule issues across AWS and Azure.
-
Assist with tagging, reporting, and resource governance to ensure compliance and cost optimization.
-
Participate in on-call rotations and incident root-cause reviews.
-
Partner with internal teams and the customer on improving operational reliability.
-
AWS Certification required (SysOps Administrator preferred; Solutions Architect or similar acceptable).
-
35+ years in AWS cloud operations or support engineering roles (Azure experience strongly preferred as secondary skill).
-
Strong understanding of:
-
VPC networking, transit gateways, routing, subnets, security groups, NACLs
-
EC2 sizing, EBS, snapshots, load balancers, Availability Zones, IAM
-
Hands-on experience supporting production systems and responding to incidents.
-
Experience with ticketing workflows and service delivery (ITIL familiarity is a plus).
-
Scripting for automation (PowerShell, Bash, or Python) light usage; not a developer role.
-
Familiarity with Azure equivalent services (VMs, VNets, NSGs, AAD, monitoring, Storage Accounts).
-
Should be willing to accept a long-term work-from-home arrangement.
-
Should be amenable to a permanent night shift schedule.