Current jobs related to Site Reliability Engineer - Kuala Lumpur, Kuala Lumpur - Embedded LLM


  • Kuala Lumpur, Kuala Lumpur, Malaysia Agensi Pekerjaan BTC Sdn Bhd Full time

    Job OpportunityOpen Position: Site Reliability EngineerAgensi Pekerjaan BTC Sdn Bhd is seeking a skilled Site Reliability Engineer to support the development and operation of full-stack software applications.Key Responsibilities:Collaborate with development teams to design, implement, and maintain scalable and reliable system infrastructure.Develop and...


  • Kuala Lumpur, Kuala Lumpur, Malaysia Agensi Pekerjaan BTC Sdn Bhd Full time

    Job OpportunityOpen Position: Site Reliability EngineerAgensi Pekerjaan BTC Sdn Bhd is seeking a highly skilled Site Reliability Engineer to join their team in Kuala Lumpur.Key Responsibilities:Design and implement operational support for full-stack software applications to ensure high availability and performance.Collaborate with development operations...


  • Kuala Lumpur, Kuala Lumpur, Malaysia TIME's group Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at TIME's group. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based systems.Key Responsibilities:System Architecture and Design: Collaborate with...

  • Reliability Engineer

    2 weeks ago


    Kuala Lumpur, Kuala Lumpur, Malaysia The Chemical Engineer Full time

    About UsThe Chemical Engineer is a leading provider of innovative solutions in the chemical industry. Our mission is to deliver high-quality products and services that meet the evolving needs of our customers.We are a dynamic and diverse organization that values technical excellence, collaboration, and innovation. Our team of experts is dedicated to...


  • Kuala Lumpur, Kuala Lumpur, Malaysia AirAsia Full time

    Job Title: Site Reliability EngineerAirAsia Software Engineering Team (AASET) is a technology centre that designs and creates custom-built solutions for the group's airline and digital businesses. It is a global initiative to drive its digital transformation.Key Responsibilities:Design and build applications around customer needs to ensure the platform is...


  • Kuala Lumpur, Kuala Lumpur, Malaysia Guidewire Software Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our Guidewire Cloud Platform team. As a key member of our SRE team, you will be responsible for ensuring the reliability, performance, and scalability of our cloud-based solutions.Key ResponsibilitiesCollaborate with development teams to troubleshoot and resolve complex issues,...


  • Kuala Lumpur, Kuala Lumpur, Malaysia Guidewire Software Full time

    Transform Insurance with Guidewire Cloud PlatformWe are seeking a skilled Site Reliability Engineer to join our team and contribute to the development and evolution of our SRE practice for applications running on our Guidewire Cloud Platform.Key ResponsibilitiesCollaborate with development teams to troubleshoot and resolve issues, minimizing customer...


  • Kuala Lumpur, Kuala Lumpur, Malaysia Razer Full time

    Job OverviewRazer is seeking a skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud infrastructure.Key ResponsibilitiesDesign and Implement Infrastructure as Code (IaC): Develop and maintain IaC using Terraform and...


  • Kuala Lumpur, Kuala Lumpur, Malaysia AirAsia Full time

    About the RoleAirAsia is seeking a highly skilled Site Reliability Engineer to join our Software Engineering Team. As a Site Reliability Engineer, you will be responsible for ensuring the smooth operation of our global technology infrastructure.Key ResponsibilitiesTechnical Leadership: Provide technical guidance and leadership to the engineering team to...


  • Kuala Lumpur, Kuala Lumpur, Malaysia The Chemical Engineer Full time

    About ExxonMobilExxonMobil is a leading energy and chemical company that is committed to addressing the dual challenge of meeting the world's growing demand for energy while reducing environmental impacts. We are a global organization with a diverse workforce and a strong presence in Malaysia.Job SummaryWe are seeking an experienced Manufacturing Control...


  • Kuala Lumpur, Kuala Lumpur, Malaysia The Chemical Engineer Full time

    About UsThe Chemical Engineer is a leading company in the energy and chemical industry, driven by innovation and a commitment to sustainability. Our vision is to advance modern living and a net-zero future through energy innovations.We are a diverse workforce fueled by pride in what we do and what we stand for. Our success is the result of the talent,...


  • Kuala Lumpur, Kuala Lumpur, Malaysia Razer Full time

    Job Summary:Razer is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for designing, implementing, and maintaining our cloud infrastructure using Infrastructure as Code (IaC) tools.Key Responsibilities:Design and Implement Cloud Infrastructure: Collaborate with our development and...


  • Kuala Lumpur, Kuala Lumpur, Malaysia WCC Full time

    About WCCWe are a software organization that strives for improving human life. Our product is an advanced Search and Match engine used in solutions for the private and public sector.Our MissionWe provide software that matters. Our team believes unity is one of our strengths. We focus on talent and possibilities, not limitations.Job SummaryWe are seeking a...


  • Kuala Lumpur, Kuala Lumpur, Malaysia The Chemical Engineer Full time

    {"h1": "Unlock the Future of Manufacturing with ExxonMobil", "p": "At ExxonMobil, we're pushing the boundaries of innovation to create a more sustainable and efficient future. As a Manufacturing Digital Solutions Engineer, you'll play a critical role in driving this transformation by leveraging cutting-edge technologies to optimize our manufacturing...


  • Kuala Lumpur, Kuala Lumpur, Malaysia Guidewire Software Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer III to join our team at Guidewire Software. As a key member of our SRE-Application team, you will play a critical role in ensuring the reliability, performance, and scalability of applications running on our Guidewire Cloud Platform.Key ResponsibilitiesCollaborate with Development Teams:...


  • Kuala Lumpur, Kuala Lumpur, Malaysia TIME's group Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at TIME's group. As a key member of our engineering team, you will be responsible for designing, building, and maintaining scalable and reliable cloud infrastructure on Google Cloud Platform.Key ResponsibilitiesSystem Architecture and DesignCollaborate with...


  • Kuala Lumpur, Kuala Lumpur, Malaysia The Chemical Engineer Full time

    About The Chemical EngineerWe are a leading company in the chemical industry, committed to innovation and excellence in our operations. Our team is passionate about delivering high-quality products and services that meet the evolving needs of our customers.Job SummaryWe are seeking an experienced Manufacturing Digital Solutions Engineer to join our team. As...


  • Kuala Lumpur, Kuala Lumpur, Malaysia The Chemical Engineer Full time

    About The Chemical EngineerWe are a leading chemical company dedicated to delivering innovative solutions that meet the evolving needs of our customers. Our team of experts is passionate about developing and implementing cutting-edge technologies that drive business growth and improve operational efficiency.Job SummaryWe are seeking an experienced...

  • Site Engineer

    1 month ago


    Kuala Lumpur, Kuala Lumpur, Malaysia Randstad Malaysia Full time

    About the CompanyYour potential employer is a Specialist Construction and Engineering company (MNC). Their projects encompasses bridges, buildings, and other structures, including bridge bearings, joints, structural repairs, and a variety of specialised construction methods.About the Job- Site Management: Oversee and coordinate construction activities...


  • Kuala Lumpur, Kuala Lumpur, Malaysia The Chemical Engineer Full time

    About the RoleWe are seeking a highly experienced Senior Process Engineer to join our team at The Chemical Engineer. As a key member of our operations team, you will be responsible for delivering process engineering design work for downstream business in the oil and gas industry.Key ResponsibilitiesPerform process engineering design work, including design...

Site Reliability Engineer

3 months ago


Kuala Lumpur, Kuala Lumpur, Malaysia Embedded LLM Full time

Our mission is to provide developers with a suite of intuitive tools and platforms that simplify the process of integrating LLMs into their software projects. We are building an open-source toolkit that empowers developers to effortlessly build cutting-edge, AI-powered applications. We're at the forefront of generative AI innovation, creating tools that streamline LLM integration, management, and deployment for developers around the world.

The Opportunity:

As our SRE, you'll be the guardian of our cutting-edge, LLM-powered developer platforms. You'll work to ensure maximum availability and efficiency, directly impacting the experiences of developers worldwide.

What You'll Do:

  • Architect for Resilience: Design and implement highly available, scalable systems optimized for LLM workloads.
  • Champion Observability: Build robust monitoring, logging, and alerting systems to gain deep insights into system health and potential issues.
  • Automate Everything: Drive efficiency through infrastructure-as-code (IaC) and robust CI/CD pipelines.
  • Mitigate Risk: Proactively implement disaster recovery, security best practices, and capacity planning strategies.
  • Collaborate for Innovation: Work closely with developers to understand platform needs and support the integration of new LLM technologies.

Why Join Embedded LLM

  • LLM Frontier: Be at the forefront of a technological revolution, shaping how LLMs transform software development.
  • Open-Source Impact: Contribute to a vibrant open-source community with global reach.
  • High-Growth Environment: Experience rapid growth and the challenges of scaling cutting-edge AI infrastructure.
  • Collaborative Team: Work alongside passionate engineers and pioneers in the LLM space.
Job Requirements

What We're Looking For

  • SRE Mindset: 3+ years of experience in Site Reliability Engineering, DevOps, or similar roles.
  • Cloud Native: Deep understanding of cloud architecture (ideally AWS, Azure, or GCP) and containerization technologies (Docker, Kubernetes).
  • Automation Ace: Strong scripting skills (Python, Ansible, Bash) and expertise in IaC tools (Terraform, CloudFormation, etc.).
  • Data-Driven: Proficiency in monitoring and observability tools (Prometheus, Grafana, etc.).
  • LLM Curious: Interest in LLMs and their unique infrastructure requirements is a plus.

Nice to Haves:

  • LLMOps Understanding: Familiarity with the operational challenges of deploying and managing large language models.
  • GPU Expertise: Experience working with GPU-accelerated infrastructure for AI workloads.
Skills

DevOps

Site Reliability Engineering

Cloud Computing

Docker (Software)

Ansible

Bash (Scripting Language)

Cloud-Native Computing

Company Benefits

Benefit from a supportive and team-focused culture that encourages collaboration and values each member's contributions.

We prioritize your professional development, offering opportunities for learning and advancement to help you achieve your career goals.

Additional InfoExperience Level

2 - 20 Years of Experience

Entry Level

Job Specialisation

Computer Engineering, Hardware / Network / Infrastructure (On-Premises / Cloud), System & IT Helpdesk / Database Administrator

#J-18808-Ljbffr