CloudOps Engineer

2 weeks ago


Petaling Jaya, Selangor, Malaysia MY E.G. Services Berhad (MYEG) Full time 60,000 - 120,000 per year

As a CloudOps Engineer, you will manage and optimize multicloud environments across Huawei Cloud, AWS, and Azure. Your role involves deploying, monitoring, and maintaining cloud infrastructure, ensuring high availability, scalability, and security across platforms.

You will handle configuration management, automate routine tasks, and support continuous integration and deployment pipelines. Additionally, you'll monitor performance, troubleshoot issues, and implement best practices for cost control and security compliance.

Collaborating with development and DevOps teams, you'll contribute to a resilient, efficient, and reliable cloud infrastructure that supports business-critical applications.

Responsibilities

1. Cloud Infrastructure Management


• Deploy, configure, and manage cloud resources across Huawei Cloud, AWS, and Azure.


• Ensure high availability, performance, and scalability of cloud-based applications.

2. Automation and Optimization


• Automate infrastructure provisioning, scaling, and routine operational tasks using Infrastructure as Code (IaC) tools


• Optimize cloud resource utilization for cost efficiency without compromising performance.

3. Monitoring and Incident Response


• Implement monitoring, alerting, and logging across multicloud environments to proactively identify and resolve performance bottlenecks and issues.


•Troubleshoot and resolve cloud infrastructure issues, escalating to appropriate teams as needed.

4. Security and Compliance


• Enforce cloud security policies, ensuring adherence to best practices and compliance standards across all environments.


• Conduct regular security assessments, vulnerability scans, and implement required patches and updates.

5. CI/CD and DevOps Support


• Integrate and maintain CI/CD pipelines across cloud platforms, supporting development teams with deployment and release processes.


• Work closely with DevOps and development teams to ensure smooth and efficient workflow across environments.

6. Documentation and Reporting


• Document configurations, processes, and procedures for cloud environments to support operational transparency and knowledge transfer.


• Generate and analyze usage reports, cost summaries, and optimization recommendations for multicloud management.

7. Disaster Recovery and Backup


• Design and implement disaster recovery and backup strategies to ensure data integrity and business continuity across clouds.


• Perform regular testing of recovery protocols and backup systems to validate effectiveness.

8. Collaboration and Stakeholder Communication


• Collaborate with cross-functional teams, including development, security, and operations, to align cloud infrastructure with business needs.


• Communicate effectively with stakeholders regarding cloud strategy, usage, and improvements.

9. Research and Development


• Stay updated on industry trends and emerging technologies in multicloud management, evaluating new tools and services for potential integration.


• Contribute to continuous improvement initiatives to enhance the resilience, performance, and cost-effectiveness of cloud operations.

Qualifications

  1. Bachelor's degree in Computer Science, IT, or related field; 3+ years of cloud infrastructure experience with Huawei Cloud, AWS, and Azure.
  2. Proficient in Infrastructure as Code and automation scripting (Python, Bash, PowerShell).
  3. Strong knowledge of monitoring, logging, and performance tools (e.g. CloudWatch, Azure Monitor).
  4. Experience with CI/CD tools and practices, integrating pipelines across multicloud environments.
  5. Familiar with cloud security best practices, IAM, and compliance standards (e.g., SOC 2, ISO
  6. Effective troubleshooting and problem-solving skills for complex multicloud issues.
  7. Excellent communication and collaboration skills across technical and non-technical teams.
  8. Preferred: Cloud certifications (AWS, Azure, Huawei), cost management experience, and disaster recovery knowledge.