Senior Devops Engineer
8 hours ago
Aethir is the leading DePIN, enterprise-grade, AI-focused GPU-as-a-Service provider in the market. By leveraging a highly distributed cloud computing infrastructure, we help GPU providers serve AI and gaming customers at scale. Our mission is to deliver powerful AI chips for enterprise clients while supporting cloud gaming for hundreds of thousands of users worldwide—all under a decentralized cloud architecture that brings compute power directly to the community.
We are looking for a Senior DevOps Engineer (Site Reliability Engineer) to join our new headquarters in Kuala Lumpur, Malaysia. In this role, you will be responsible for maintaining, optimizing, and scaling our production systems to ensure high availability, reliability, and performance across our decentralized compute network. You'll play a key part in supporting mission-critical infrastructure for our AI and cloud gaming customers globally.
Key Responsibilities:
- Monitor, Review, and Respond to Faults: Take on the responsibility of monitoring, reviewing, responding to faults, troubleshooting, resolving, and subsequently optimizing the production system.
- System Architecture and Performance: Continuously monitor and review the system architecture, process logic, system performance, stability, and other technical areas and indicators to ensure their rationality.
- Coordination with Business Team: Drive the business team in resolving any issues related to operations and maintenance.
- Production Failure Response: Respond promptly to production failures, acting as the overall coordinator for resolution.
- Collaborative Problem-Solving: Organize relevant R&D, operations and maintenance, and product teams to collaboratively investigate and resolve problems.
- Failure Response Time: Responsible for the failure response time and resolution time, ensuring timely resolution of issues.
- Case Studies and Optimization: Conduct case studies on production issues and follow up with optimizations to improve system performance and stability.
- Documentation: Maintain comprehensive documentation of system architecture, processes, and troubleshooting procedures.
- Continuous Improvement: Identify areas for improvement in the operations and maintenance processes and implement necessary changes.
- Bachelor's degree in Computer Science, Engineering, or a related field.
- Experience in operations and maintenance development, preferably in a cloud computing or AI-focused environment.
- Strong understanding of system architecture, performance monitoring, and troubleshooting methodologies.
- Excellent communication and collaboration skills.
- Ability to work in a fast-paced, startup environment.
- Proficiency in Kubernetes (K8S), CI/CD, and Docker.
- Expertise in AWS (VPC, S3, EC2, etc.) or Python (one of the two).
- Responsible for building the operations and maintenance infrastructure platform and handling core business operations.
- Management experience is a plus, but not required.
- Prior experience working in structured environments such as Huawei, ZTE, or banking institutions is preferred.
- Fluency in Mandarin is mandatory (written and spoken)
- Hypergrowth Startup Environment
- Fantastic Career Progression Opportunities
- Work within a Global and Local Team
- Collaborative and innovative work environment with opportunities to contribute to cutting-edge projects.
-
DevOps Engineer
2 days ago
Petaling Jaya, Selangor, Malaysia Hitachi eBworx Sdn. Bhd. Full time 90,000 - 120,000 per yearJoin Us as a DevOps EngineerWe're looking for a skilled DevOps Engineer with expertise in AWS and Kubernetes to design and maintain resilient, scalable cloud infrastructure for microservices. You'll work with cutting-edge tools, collaborate with cross-functional teams, and play a key role in ensuring our applications are reliable, secure, and...
-
Devops Engineer
9 hours ago
Petaling Jaya, Selangor, Malaysia Aethir Full time 80,000 - 120,000 per yearAethir is the leading DePIN, enterprise-grade, AI-focused GPU-as-a-Service provider in the market. By leveraging a highly distributed cloud computing infrastructure, we help GPU providers serve AI and gaming customers at scale. Our mission is to deliver powerful AI chips for enterprise clients while supporting cloud gaming for hundreds of thousands of users...
-
DevOps Engineer
9 hours ago
Petaling Jaya, Selangor, Malaysia atQuest Full timeAbout the RoleWe are looking for a DevOps Engineer who can manage our current on-premise deployment operationswhile helping us transition to a hybrid modern architecture.You'll handle .NET 4.0 / 6.0 applications today and help pave the way for .NET 7 & 8 with Linux-readydeployments tomorrow.Our systems run on MSSQL Server Enterprise (on-prem), and we plan to...
-
DevOps Manager
8 hours ago
Petaling Jaya, Selangor, Malaysia Aethir Full time $100,000 - $120,000 per yearAethir is the only Enterprise-grade AI-focused GPU-as-a-service provider in the market. Its decentralized cloud computing infrastructure allows GPU providers (containers) to meet Enterprise clients who need powerful GPU chips for professional AI/ML tasks. Thanks to a constantly growing network of over 40,000 top-shelf GPUs, including 3,000 NVIDIA H100s,...
-
DevOps Engineer
2 days ago
Petaling Jaya, Selangor, Malaysia Hilti (Malaysia) Sdn Bhd Full time 120,000 - 240,000 per yearWhat's the role? The DevOps Engineer is responsible for managing the full DevOps lifecycle, including system operations, monitoring, deployment automation, and infrastructure as code (IaC). This role ensures the availability, performance, scalability, and security of cloud and on-premises systems. The ideal candidate is skilled in AWS technologies,...
-
DevOps Engineer
2 days ago
Petaling Jaya, Selangor, Malaysia Hong Leong Assurance Berhad Full time 60,000 - 80,000 per yearWe are looking for an experienced DevOps engineer to help us manage as well as reinvent our services platform. The Services team is responsible for building powerful APIs with disruptive features that power our various apps and services. HLA is going on a journey of modernization and innovation, in order to be a disruptive force in the insurance industry. We...
-
Internship - DevOps Engineer
2 days ago
Petaling Jaya, Selangor, Malaysia Hilti (Malaysia) Sdn Bhd Full time 40,000 - 60,000 per yearWhat's the role? Join us as an Intern in our IT team and you will work on exciting projects as a DevOps Engineer. As a member of our international and multidisciplinary team, you will gain experience in global IT project and solution management, improve your practical expertise, and solve real-life challenges in an international enterprise.The start date...
-
DevOps Engineer – Monitoring
8 hours ago
Petaling Jaya, Selangor, Malaysia CFI Financial Group Full time 80,000 - 120,000 per yearWho are we?CFI Financial Group is an award-winning trading provider, possessing more than 25 years of experience with multiple offices around the world including London, Larnaca, Beirut, Amman, Dubai, Kuwait, Port Louis, and others.Check out more about CFI here.CFI is hiring Make your mark in the online trading industry.Are you looking to pursue a career in...
-
Internship - DevOps Engineer
2 days ago
Petaling Jaya, Selangor, Malaysia Hilti Group Full time 4,000 - 8,000 per yearWHAT'S THE ROLE?Join us as an Intern in our IT team and you will work on exciting projects as a DevOps Engineer. As a member of our international and multidisciplinary team, you will gain experience in global IT project and solution management, improve your practical expertise, and solve real-life challenges in an international enterprise. The start date...
-
Senior Infrastructure Engineer
2 days ago
Petaling Jaya, Selangor, Malaysia Grab Full time 120,000 - 180,000 per yearCompany DescriptionAbout Grab and Our WorkplaceGrab is Southeast Asia's leading superapp. From getting your favourite meals delivered to helping you manage your finances and getting around town hassle-free, we've got your back with everything. In Grab, purpose gives us joy and habits build excellence, while harnessing the power of Technology and AI to...