Job Title
ClickHouse Cluster Engineer, Distributed Analytics Platforms
Location
Islamabad, Hybrid or Remote
Full time
About the Role
We are looking for a hands-on ClickHouse Cluster Engineer with 3 or more years of relevant experience who owns the reliability, performance, and scalability of distributed analytics infrastructure. This role goes beyond routine maintenance. You will design cluster architecture, solve performance challenges at scale, and build automation that keeps systems resilient as data volumes and usage grow.
If you thrive on deep technical ownership, production problem solving, and building systems that remain stable under pressure, this role is for you.
Core ResponsibilitiesCluster Management and Architecture
- Design, install, configure, and operate distributed ClickHouse clusters in production
- Implement replication, sharding, and high availability using ZooKeeper
- Continuously evolve cluster topology to support product growth and performance demands
Performance Optimization
- Monitor and troubleshoot performance bottlenecks across large analytical workloads
- Optimize queries, schemas, and storage engines, including advanced MergeTree configurations
- Tune ClickHouse parameters to achieve predictable, high throughput performance
Availability and Disaster Recovery
- Design and maintain backup, restore, and disaster recovery strategies
- Ensure data integrity, fast recovery, and minimal downtime during failure scenarios
Monitoring and Alerting
- Build and maintain proactive monitoring and alerting using Prometheus and Grafana
- Define meaningful metrics and alerts that enable early detection and prevention of incidents
Automation and Infrastructure as Code
- Automate provisioning, maintenance, and operational workflows
- Use scripting languages such as Shell or Python and tools like Terraform and Ansible to eliminate manual processes
Security and Access Control
- Manage user access, roles, and permissions
- Ensure compliance with data security, privacy, and audit requirements
Collaboration and Support
- Partner with engineering, data, and operations teams
- Provide guidance on ClickHouse best practices and support application integrations and deployments
Required Skills and Experience
- Minimum 3 years of hands-on experience managing ClickHouse or comparable distributed analytical databases in production
- Strong understanding of distributed systems, replication, sharding, and high availability concepts
- Proven experience in performance tuning for large-scale analytical workloads
- Practical knowledge of Linux, scripting, and infrastructure automation
- Experience with monitoring and observability tools such as Prometheus and Grafana
Nice to Have
- Experience operating ClickHouse at high data volumes or in multi-cluster environments
- Exposure to cloud or hybrid infrastructure
- Strong ownership mindset with the ability to work independently on complex systems
Why Join Us
- Work on real, large-scale data systems with direct business impact
- High autonomy and meaningful technical ownership
- Opportunity to influence architecture, standards, and best practices from the ground up
Job Type: Full-time
Pay: From Rs150,000.00 per month
Work Location: In person