Overview
We are seeking a hands-on StarRocks Engineer to join our Developer Observability team. You will architect, operate, and optimize large-scale StarRocks deployments powering real-time analytics and observability dashboards. This role requires deep expertise in StarRocks internals, performance tuning, operational excellence, and integration with modern data pipelines.
Responsibilities
* Design, build, and maintain production-grade StarRocks clusters for high-volume, low-latency analytics workloads.
* Lead schema/data modeling for clickstream, dimensional, and time-series data, ensuring deduplication, upserts, and efficient partitioning.
* Optimize query performance using materialized views, indexes, and advanced StarRocks features (vectorization, pipeline execution).
* Diagnose and resolve high-severity incidents, leveraging logs, metrics, and profiling tools; implement permanent fixes and guardrails.
* Plan and execute capacity planning, scaling, and resource management for multi-tenant environments.
* Develop and enforce backup, disaster recovery, and upgrade strategies with minimal downtime.
* Implement security best practices: AuthN/AuthZ, encryption, auditing, and data masking.
* Collaborate with engineering and SRE teams to define SLOs, observability dashboards, and alerting for StarRocks infrastructure.
Qualifications
* Proven experience architecting and operating StarRocks (or similar OLAP) clusters at scale (10B+ events/day, TB–PB data).
* Deep understanding of StarRocks FE/BE architecture, storage engines, compaction, and cost-based optimizer.
* Expertise in schema design, partitioning, bucketing, and index/encoding strategies for hot/cold data tiers.
* Strong troubleshooting skills: root cause analysis, performance profiling, and incident response.
* Experience with ingestion pipelines (Kafka/Kinesis), exactly-once semantics, and CDC-based upserts.
* Familiarity with materialized view design, refresh policies, and query rewrite mechanics.
* Operational excellence: backup/restore, rolling upgrades, resource governance, and multi-tenant management.
* Security and compliance: LDAP/OIDC integration, RBAC, encryption, and sensitive data handling.
* Excellent communication skills; ability to diagram architectures and explain trade-offs to technical and non-technical stakeholders.
Nice to Have
* Experience with other OLAP systems (ClickHouse, Druid, Pinot) and comparative benchmarking.
* Contributions to StarRocks open source or related communities.
* Cloud-native deployment experience (Kubernetes, autoscaling, infrastructure-as-code).