Data Centre Operations Engineer

Data Centre Operations Engineer
Your New Company

My client’s nature of business is within the Ai Infrastructure sector, providing PaaS / SaaS platforms.

Your New Role

  • Bachelor’s degree in Computer Science, Information Technology, Electrical Engineering, or a related field. Equivalent experience will be considered.
  • 2+ years of experience in system operations within IT infrastructure or cloud services.
  • Hands-on experience in IT hardware replacement.
  • Experience in data centre operations, system administration, or a similar role.
  • Knowledge of server hardware, including GPU cards, CPU configurations, and storage solutions.
  • Understanding of Linux fundamentals and Kubernetes environments.
  • Familiarity with monitoring tools (e.g., Prometheus, Grafana) and logging frameworks.

What You’ll Need to Succeed


  • Oversee the daily operations of GPU clusters and data centre systems.
  • Monitor system health, performance, and capacity using industry-standard tools and frameworks.
  • Respond to and resolve operational incidents, ensuring minimal downtime and maximum availability.
  • Manage the deployment, configuration, and optimisation of GPU servers, network devices, and supporting infrastructure (e.g., CPU servers and storage).
  • Perform hardware diagnostics and preventative maintenance for GPU servers, storage, and networking equipment.
  • Troubleshoot system issues related to hardware, operating systems, and applications.
  • Work closely with cross-functional teams, including network engineers, system administrators, and developers, to support AI workloads.
  • Maintain accurate documentation for system configurations, processes, and incident reports.
  • Implement and enforce security best practices in system operations.
  • Identify and propose improvements to enhance system performance, reduce costs, and optimise resource utilisation.
What You’ll Get in Return



  • Attractive Employee Incentive Scheme + Bonuses + Allowances
  • Professional training and development programme
  • Medical Insurance


What You’ll Need to Do Now

If you think this job is for you, what are you waiting for? Hit "apply now" for more details or a confidential discussion. Please contact Julian Yew at Hays on +603-5870-5003
Or email
Julian.Yew@hays.com.my.

At Hays, we value diversity and are passionate about placing people in a role where they can flourish and succeed. We actively encourage people from diverse backgrounds to apply.

Summary

Job Type
Permanent
Industry
Technology & Internet Services
Location
Malaysia
Specialism
Infrastructure
Pay
Basic + Allowances + Benefits
Ref:
1275722

Talk to a consultant

Talk to Julian Yew, the specialist consultant managing this position, located in Sunway
Corporate Suite 19-01 at Sunway Resort Hotel,, Persiaran Lagoon,

Telephone: +60358705003

Similar jobs to Data Centre Operations Engineer

  • Salesforce Administrator

    Salesforce Administrator role with a global fintech
    Malaysia
  • Head of IT Infrastructure

    Head of IT Infrastructure
    MalaysiaUp to RM20k basic per month.
  • Data Centre Portfolio Manager

    Portfolio Manager managing large scale construction of data centre projects from end to end
    MalaysiaCompetitive Senior Level salary package
  • Data Analytics, Assistant Manager

    A prominent financial institution that prioritises digital innovation is looking for an AM, Data Analytics.
    Malaysia
  • Head of Data Governance

    Prominent financial institution that prioritises digital innovation looking for a Head of Data Governance.
    Malaysia