We’re ambitious, curious, and gutsy doers. We practice a low hierarchy across the company and high morale in our teams. We’ve already achieved a lot, yet we’re only getting started. Now it’s your chance to join the ride. We offer more than just the job – we offer a career-defining opportunity to be part of building something big!
Join Verda while it’s still being built – not once it’s finished.
Why Verda
Practicalities
About The Role
Verda’s customers run AI workloads that cannot afford to go down. Behind every SLA we sign is a data center that has to deliver it around the clock, every day of the year. We are looking for a Data Center Operations & Reliability Manager to own that promise.
You will be accountable for the operational reliability of our data center sites: committing to and following up on our SLAs, tracking and mitigating equipment downtime, running the 24/7 shift coverage of our support engineers, enforcing safety and security guidelines, and owning the incident reporting loop from first alert to closed follow-up.
What You Will Do
Own SLA commitments and performance. Define, monitor, and report on service levels, and drive corrective action when targets are at risk.
Track equipment downtime across sites, analyze failure patterns, and lead mitigation: root cause analysis, preventive measures, and escalation with vendors where needed.
Plan and manage 24/7 shift schedules for support engineers, ensuring continuous coverage, fair rotation, and adequate staffing for planned maintenance and peak periods.
Enforce and continuously improve Safety & Security guidelines — ensuring all on-site work follows established protocols and compliance requirements.
Oversee incident reports end-to-end: ensure incidents are documented, communicated, followed up, and closed with root cause and prevention actions.
Report regularly to management on reliability metrics, incident trends, and operational risks.
What We Are Looking For
5+ years of experience in data center operations, critical facilities, or mission-critical infrastructure environments.
Proven experience managing or scheduling teams in a 24/7 shift-based operation.
Hands-on understanding of data center infrastructure: power, cooling, networking and common failure modes.
Experience with SLA management and operational reporting in a customer-facing infrastructure business.
Strong incident management skills: structured response, root cause analysis, and disciplined follow-up.
Familiarity with safety and security protocols in critical environments.
Strong written and verbal English.
Strong Plus
Experience in GPU, HPC, or hyperscale cloud environments, including high-density racks and liquid cooling.
Experience with monitoring, ticketing, and maintenance management systems (e.g., DCIM, CMMS).
Data center certifications such as CDCP or equivalent.
Experience building reliability processes from scratch in a fast-growing company.
What’s Next
We’re building fast and this role needs the right person behind it. There’s no artificial deadline, but when we find who we’re looking for, we move. If this sounds like your next move, apply now.
Please submit your application through our Careers page. We don’t accept applications sent by email.
To get the best candidate experience, please consider applying for a maximum of 3 roles within 12 months to ensure...
Apply For This JobAt Loaf we make insanely comfy sofas, beds and homewares that help people lead more comfortable lives. Founded in 2008,...
Apply For This JobCompany Description PureGym Group is a global leader in the fitness industry, boasting a network of over 700 gyms and...
Apply For This JobAll the detailsSummaryWorking in Commercial Food at M&S means sitting at the heart of some of the most commercially driven,...
Apply For This JobAt BairesDev®, we’ve been leading the way in technology projects for over 15 years. We deliver cutting-edge solutions to giants...
Apply For This JobSnapshot We are seeking a trailblazing Global Event Programs Lead on a fixed term contract to develop and deliver pioneering...
Apply For This Job“`
Search qualified candidates by skills, location, experience, education, and more.
“`
