About SUBCO
With a proven track record of delivering transformative connectivity, SUBCO owns and operates critical submarine cable assets linking Australia to Asia, the Middle East, and beyond. Building on this foundation, a new 5,000 km domestic cable system - connecting emerging digital hubs across Australia - is set to go live in 2026, marking a major milestone in SUBCO's evolution.
As demand for high-capacity, resilient infrastructure accelerates, SUBCO is entering a strategic growth phase of major scale up, actively pursuing global opportunities to deliver innovative connectivity solutions that meet the needs of cloud, content, and AI-driven enterprises.
Our Values
At SUBCO, our values are the foundation of everything we do. They shape our culture, guide our decisions, and define how we interact with our customers, partners, and each other. From being
Customer Obsessed
to always
Raising the Bar
, we are committed to innovation and excellence. Together, we are a team of Trailblazers who
Grow Together
and hold ourselves accountable, ensuring that
We Each Set the Standard
for success. These values aren't just words – they drive us forward every day.
Role Purpose
As a Network Automation and Production Engineer at SUBCO, you will sit at the intersection of operations and engineering. You'll ensure the availability and performance of our global optical and IP networks while building the automation, observability, and tooling that make the network more resilient, scalable, and self-healing.
This role balances incident response and on-call network operations with proactive development of dashboards, alerting, orchestration, and automation tools that reduce manual effort, improve visibility, and accelerate recovery across our critical infrastructure.
Ideally this role is located in our head office in Brisbane, but we are flexible on location.
Key Responsibilities
Observability & Monitoring
* Develop and maintain monitoring and telemetry platforms using Grafana, Prometheus, Alertmanager, and related tooling.
* Build dashboards, alerts, and automated responses that provide real-time insight into fibre, optical transport, IP routing/switching, and environmental systems.
Automation Engineering
* Design, code, and maintain automation solutions for configuration management, deployment, validation, and troubleshooting.
* Create scripts, APIs, and workflows that eliminate repetitive tasks and reduce operational overhead.
* Contribute to CI/CD pipelines for network and infrastructure services.
Incident Response & On-Call
* Act as part of the 24x7 on-call rotation, triaging and resolving incidents to restore service quickly and safely
* Manage incident communications to internal stakeholders and external customers, ensuring timely and transparent updates.
* Perform deep-dive troubleshooting across the full stack of network systems to support service restoration.
Operations & Change
* Ensure robust engineering of end-to-end solution for customer services covering Fibre patches, network equipment and operational systems.
* Execute network changes and maintenance via peer-reviewed Methods of Procedure (MOPs), ensuring safety and repeatability.
* Maintain clear and accurate records in Jira Service Management for incidents, changes, and problem tickets.
Reliability & Continuous Improvement
* Identify systemic issues through trend analysis, and evolve monitoring rules, automation playbooks, and escalation workflows.
* Participate in post-incident reviews (PIRs), helping feed insights into better automation, resilience, and reliability
* Drive a culture of continuous improvement by embedding automation and observability into all operational practices. Uplift network operations staff from tactical incident response towards strategic production engineering practices.
Key skills and experience required to be successful in the role
* Experience working in a network operations environment ideally within a tech company
* Experience with backbone engineering : Optical Networks and/or Enterprise grade routing/switching
* Proficiency in scripting and automation using Python, Ansible, or similar tools.
* Must have experience in software or system development and tooling including knowledge of open standard cloud/open-source platforms incl. gNMI, Prometheus, Grafana, MongoDB.
* Proven ability to apply automation and continuous improvement practices to enhance day-to-day operations.
* Skilled in network performance troubleshooting and event triage within complex global network environments
* Hands-on experience with service management systems (Jira Service Management or similar) across incident, change, and problem management workflows
* Experience with monitoring, alerting, and observability platforms such as Observium, MCP by Ciena, and New Relic
* Clear and calm communicator with the ability to effectively manage customer expectations and internal escalations under pressure
* Self-motivated and hands-on, with a demonstrated ability to engage directly in technical implementation and operational improvement
* Strong documentation habits, including incident summaries, PIRs, and shift handover reports
* Self-motivated team player, adaptable to shift-based work and remote collaboration with global colleagues
Skills and experience that would set you apart from the rest
* Familiarity with network protocols (BGP, OSPF, VLANs, VPNs, etc.).
* Experience operating in a 24x7 NOC or network support function across network backbone domains
* Comfortable working across vendor platforms including Cisco, Ciena, Nokia (routers, switches, optical)
* Familiarity with Data Centre environments including rack management, power, and thermal considerations
* Understanding of security compliance and physical access control processes within telecommunications facilities
* Working knowledge of ITIL v3 or v4 principles including Service Operations and Service Transition
* Competent in executing and reviewing Method of Procedures (MOPs) for planned work and changes
* Strong knowledge of transmission, long-haul, subsea, and optical networks (DWDM, MPLS, BGP, Ethernet)